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Nucleic acids encoding an ISUM2A polypeptide and 
use of said nucleic acids to obtain 
5 transformed plants producing grains 

altered in germ development 

FIELD OF THE INVENTION 

10 

The present invention relates to the field of improving the 
agronomic characteristics of plants for the purpose of obtaining plant 
transformation products with improved characteristics for industry, most 
particularly for the agrofoods industries, for example for the production of 
is starch, of plant oil or of semolina. 

STATE OF THE ART 

In general, there exists a need, in the state of the art, to 

20 improve the agronomic, dietary or industrial qualities of grains, in 
particular for industries producing starch, plant oil or semolina. 

The grain embryo is the compartment of the grain from which 
plant oil is extracted. It would be industrially advantageous to obtain 
plants in which the grains have an overdeveloped embryo rich in oil. 

25 Conversely underdeveloped embyros would be particularly 

advantageous for the production of semolina. 

It would also be advantageous for the starch industry to obtain 
plants in which the grains lack germs, the grains consisting essentially of 
the starch-rich albumen. 

30 In general, there exists a need, in the state of the art, for the 

isolation and characterization of regulatory sequences allowing specific 
expression of a nucleic acid of interest in the embryo and/or the 
albumen, early in the course of the plant's development, in order to 
modulate the agronomic qualities of the mature grain. 

35 The amount of starch produced today, for multiple industrial 

applications, is approximately 1 billion tonnes annually throughout the 
world. Starch represents a raw material which is used in industries as 
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diverse as the food industry, pharmacy and the paper industries, but also 
in the microbiological field, where it can be used as a nutritive substrate. 

Conventionally, starch is obtained from the grains of large crop 
cereal plants such as wheat, maize and sorghum. 
5 The grains consist essentially of a germ associated with the 

albumen. The starch is entirely contained in the albumen, while the germ 
is rich in oil. As a result of this, the processes for preparing starch used 
in the starch industry necessarily comprise a step for separating the 
germ, which is rich in oil, from the albumen which contains an 

10 abundance of the starch constituting the raw material of interest. 

It would therefore be particularly advantageous technically to 
eliminate the step separating the germ from the albumen in starch 
purification processes, which would make it possible to significantly 
simplify these processes and also to make them more rapid and less 

15 expensive. 

There also exists the need, in the state of the art, to improve 
the agronomic qualities of the grains used in processes for converting 
the vitreous portion of the maize grain (kernel) into semolina. The milling 
process makes it possible to finely separate the various constituents of 

20 the grain in order to satisfy users' requirements. It comprises in particular 
a drying step which should be as gentle as possible in order to avoid 
migration of the lipids from the germs to the kernel, which is prejudicial to 
the subsequent quality of the semolinas, and a degerming step, carried 
out by fragmentation, the aim of which is to carefully separate the germs 

25 in order to avoid the lipids that they contain ending up in the semolinas. 

The use in the semolina industry of the grains altered in germ 
development according to the invention therefore presents a definite 
advantage since it facilitates the industrial process (gain in time and 
yield) by eliminating this 'contamination* of the kernel with the lipids from 

30 the germ; in addition, it ensures good quality and a good lifespan of the 
products, this being a function of the residual of fats (<0.8% is a 
standard in the profession for 'noble' products: hominy, grits, semolinas 
and flours for human food). 
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The semolinas produced are then used almost exclusively in 
human foodstuffs (beers, breakfast cereals, crackers, polenta, tortilla, 
corn chips, food flours, etc). 

By way of examples, the grains according to the invention may 

5 be used for the production of breakfast cereals or 'cornflakes', which 
constitute a market which has, for 15 years, been experiencing an 
annual average increase of the order of 20%. Two processes are 
conventionally used: a conventional rolling process using hominy (the 
largest semolinas completely degermed and calibrated) or a process by 

10 cooking-extrusion using semolinas with a specific particle size. 

According to another embodiment of the invention, it is possible 
to obtain grains with large embryos, which is desired in the oil industry. 
Corn oil is generally intended for human food; it is sometimes used by 
the pharmaceutical industry and the cosmetics industry. The cakes are, 

is themselves, exploited in animal feedstuffs either directly, or else remixed 
with corn-gluten feed. 

Many plants carrying mutations affecting both the embryo and 
the albumen of the grain are known in the state of the art. These mutant 
plants are conventionally referred to as "dek" (for "defective kernel") 

20 mutants. 

On the other hand, there are very few maize mutants which are 
altered by a deficiency in the development of the embryo of the grain but 
which leaves the albumen intact. 

These are essentially the mutant plants observed by Sheridan 
25 et al. (1993, 1995) and those described by Elster et al. (2000). However, 
these authors provide no result regarding cosegregation of genetic 
markers with the mutant phenotype. 

Heckel et al. (1999) have analyzed five maize mutants affected 
at various stages of embryo development. 
30 A first group of mutants comprises mutants which produce pro- 

embryonic structures resembling those produced in the wild-type plants, 
but the pro-embryos do not reach the subsequent developmental stages. 

In the second group of mutants, the mutants emb*-8522 and 
emb*-8535 are altered by a complete deficiency in apical-basal 
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differentiation, whereas, in the mutant emb*-8576, the authors have 
observed embryo-like structures arising from the suspensor. 

An analysis of cosegregation of the emb*-8516 and emb*-8522 
mutations enabled Heckel et al. to determine that the mutant phenotype 
5 cosegregated with the presence of a transposon. However, the 
molecular characterization of the various mutations was not described. In 
particular, the mutation affecting the emb*-8516 mutant plant could not 
be located on any of the maize chromosomes. 

10 SUMMARY OF THE INVENTION 

The invention provides nucleic acid sequences and 
polypeptides, the expression of which is essential for embryo 
development in a plant grain, and a deficiency in the expression of which 
is leads to the production of grains altered in germ development. 

A subject of the invention is a nucleic acid comprising a 
polynucleotide encoding an ISUM2A polypeptide, chosen from the 
sequences having at least 95% amino acid identity with the sequences 
SEQ ID No. 5 and SEQ ID No. 6, or encoding a fragment of an ISUM2A 
20 polypeptide, and also a nucleic acid of complementary sequence. 

Preferably, the nucleic acid encoding the ISUM2A polypeptide, 
or encoding a fragment of this polypeptide, also comprises a regulatory 
polynucleotide capable of regulating the synthesis of the ISUM2A 
polypeptide, the regulatory polynucleotide preferably being sensitive to 
25 the action of an inducing signal. 

The regulatory polynucleotide may be equally a transcription- or 
translation-repressing polynucleotide or, on the contrary, a transcription- 
or translation-activating polynucleotide. 

The invention also relates to methods for obtaining a 
30 transformed plant capable of producing grains with altered germ 
development, and also parts of such a plant, in particular its seeds. 

A subject of the invention is also a product of transformation of 
a seed with altered germ development produced by said transformed 
plant, preferably a starch. 
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A subject of the invention is also the ISUM2A polypeptide, or a 
fragment of this polypeptide, encoded by a nucleic acid as defined 
above, and also antibodies directed against the ISUM2A polypeptide. 

The invention also relates to methods for detecting the 
5 presence of the ISUM2A polypeptide in a sample, and to methods for 
detecting the presence of a nucleic acid encoding this polypeptide in a 
sample. 

DETAILED DESCRIPTION OF THE INVENTION 

10 

According to the invention, a population of 25 000 plants having 
been subject to a high degree of mutagenesis by the random insertion, 
into their genome, of the mutator transposon described by Bennetzen, 
J.L. P.S. Springer, A.D. Cresse, and M. Hendrickx (1993), Chandler, V.L. 
15 and K.J. Hardeman (1992) has been generated. The plants were placed 
in culture before analyzing their DNA. 

For one of the mutant plants, designated G2422, it was shown 
that the insertion of the mutator transposon was located in the first intron 
of a particular gene, this gene having been designated isum2A, at a 
20 distance of 3 bp from the second exon of this gene. 

On another mutant plant, the inventors observed cosegregation 
between the presence of a "germ-free grains" phenotype and the 
insertion of the muta tor transposon into this same isum2A gene. 

The isum2A gene, which is shown, according to the invention, 
25 to be necessary for the normal development of the embryo of the grain, 
was then isolated and completely characterized. 

The isum2A gene comprises three exons and two introns. The 
open reading frame begins in the first exon and ends in the third exon of 
this gene. 

30 A partial product of transcription of the isum2A gene was also 

isolated and characterized, and the structure of a large part of the 
ISUM2A protein was deduced from the cDNA corresponding to the 
transcription product of the gene. 
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It has been shown, according to the invention, that the isum2A 
gene is expressed in the embryo and the albumen 12 days after 
pollination, and also in the leaves and the roots. 

In addition, the applicant has shown that insertion of the 
5 mutator transposon into intron No. 2 of the gene caused blocking of the 
expression of the isum2A gene, since the presence of its transcription 
product was no longer detectable in an albumen of a plant homozygous 
for the interruption of the isum2A gene (isum2A::Mu/isum2A::Mu). 

It has thus been shown, according to the invention, that plants 
10 homozygous for mutation in the gene encoding the ISUM2A polypeptide 
produce grains consisting essentially of the albumen, and having altered 
germ development. The germ is also called embryo. 

The applicant has also shown that the seeds representing a 
quarter of the grains produced by heterozygous plants, and in which both 
15 copies of the isum2A gene comprise a mutation, do not allow a viable 
and fertile plant to be obtained. 

It has more particularly been shown, according to the invention, 
that the seeds representing a quarter of the seeds produced by plants 
heterozygous for the mutated allele(s) resulting in the recessive mutation 
20 of the "emb" type would not make it possible to obtain a viable and fertile 
plant. 

In order to remedy this drawback, also provided according to 
the invention are complementation systems which allow the expression 
of a nucleic acid encoding a functional ISUM2A polypeptide in plants in 
25 which the two alleles of the isum2a gene are mutated, it being possible 
for these complementation systems to be made inducible so as to 
precisely control, over time, the expression of a nucleic acid encoding an 
ISUM2A polypeptide. 

It is thus possible, by virtue of the invention, to obtain viable 
30 plants which produce grains with altered germ development, it being 
possible for these grains to be directly used in industry, without a step for 
separating the embryo, most particularly in the starch industry and in the 
semolina industry. 

According to another aspect, the invention also provides the 
35 means for obtaining plants which produce grains altered in germ 
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development, in which the germ is enriched in oil due to early expression 
or overexpression of the ISUM2A polypeptide. 

The invention relates to nucleotide sequences involved in 
embryo development, and to their use in molecular constructs intended 
5 to improve the agronomic, food or industrial quality of a plant, modulating 
in particular the size of the embryo and/or its development. 

In fact, an early and specific action on the development of the 
tissues of the embryo and of the albumen may be desired: 

1) According to a first embodiment, it will be possible to obtain grains or 
10 fruits enriched in oil (large embryo), via the use of promoters which 

direct the expression of the nucleic acids according to the invention 
early in the development of the grain, and more particularly in the 
germ, or constitutively. 

2) According to another embodiment, albumens with altered germ 
is development could also be obtained according to this model, for 

industrial applications in starch production and semolina production. 

The invention is therefore also directed toward methods for modifying 
the agronomic and/or nutritional qualities of a plant, by a targeted and 
early action on the development of the embryo, using transformation of 
20 the plants with a vector according to the invention. In particular, it is 
directed toward modifying the size and/or the development of the 
embryo. It is also directed toward altering the development of the 
embryo, for the purpose of producing embyro-free grains for cereals in 
particular, which are of value for the starch and semolina industries. 
25 A subject of the invention is also the use of an expression cassette as 
defined above, for obtaining a transgenic angiosperm plant exhibiting 
improved agronomic or nutritional qualities. 

Advantageously, the transgenic plant obtained can produce grains 
with modified oil contents or with altered germ development, in 
30 comparison with a nontransformed plant. 

The invention also relates to the use of the transgenic plants obtained 
according to the invention, or parts of these plants, in particular seeds, 
grains and fruits, for preparing derived products, in particular food 
products. 
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Also part of the invention are the products obtained, whether they are 
seeds, seed meals or grains enriched in oil or seeds with altered germ 
development, which are suitable for the semolina industry. 

The invention also relates to any composition for human or animal 
5 food prepared from said obtained products. 

According to another aspect, the invention relates to the use of 
the sequences and allelic variants defined according to the invention, in 
selection programs aimed at producing plants with an embryo modified in 
terms of size and/or development having an influence on the starch/oil 
10 content. These sequences can in particular be used in experiments for 
mapping and colocalizing QTLs for the oil content or the size of the 
embryo, in order to define the most advantageous allelic variants for a 
selection approach, comprising: 

- genotyping individuals by means of nucleic acid probes or 
is primers obtained from sequences and allelic variants 

described according to the invention; 

- selecting, from these individuals, plants which comprise a high 
frequency of favorable alleles associated with the size and/or 
with the development of the embryo. 

20 isum2A GENE NUCLEIC ACIDS 

The nucleotide sequence of the isum2A gene comprises, from 
the 5' end to the 3' end, (i) a noncoding upstream sequence potentially 
carrying regulatory elements for transcription and/or translation of the 
gene, (ii) a coding sequence comprising the three exons and the two 

25 introns of the gene and (iii) a noncoding sequence located downstream 
of the final exon of the gene. 

The sequence of the isum2A gene according to the invention is 
referenced as the sequence SEQ ID No. 1 of the sequence listing. 

Analysis of a population of plants having the wild-type 

30 phenotype and producing grains containing a normal embryo has made it 
possible to identify at least two variants of the isum2A gene. One of the 
variants of the gene is characterized by the substitution of the G 
nucleotide located at position 2234 of the sequence SEQ ID No. 1 with a 
C nucleotide. This nucleotide substitution leads to the substitution of the 

35 amino acid G (glycine) at position 89 of the ISUM2A polypeptide 
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sequence SEQ ID No. 5 with an amino acid R (asparagine) which is 
present in the sequence of the variant ISUM2A polypeptide, SEQ ID N°6. 

Thus, a first subject of the invention consists of a nucleic acid 
comprising a polynucleotide encoding an ISUM2A polypeptide chosen 
5 from the sequences having at least 95% amino acid identity with the 
sequences SEQ ID No. 5 and SEQ ID No. 6, or encoding a fragment of 
an ISUM2A polypeptide. 

The invention also relates to a nucleic acid of sequence 
complementary to the nucleic acid as defined above. 

10 According to the invention, any conventional molecular biology, 

microbiology and recombinant DNA techniques known to those skilled in 
the art may be used. Such techniques are described, for example, by 
SAMBROOK et al. (1989), GLOVER (1985), GAIT (1984), HAMES and 
HIGGINS (1984), BERBAL (1984) and AUSUBEL et al. (1994) . 

15 Preferably, any nucleic acid and any polypeptide according to 

the invention is in an isolated or purified form. 

For the purpose of the present invention, the term "isolated" 
denotes a biological material which has been removed from its original 
environment (the environment in which it is naturally located). For 

20 example, a polynucleotide present in the natural state in a plant is not 
isolated. The same polynucleotide separated from the adjacent nucleic 
acids into which it is naturally inserted in the genome of the plant is 
isolated. Such a polynucleotide may be included in a vector, and/or such 
a polynucleotide may be included in a composition, and nevertheless 

25 remain in the isolated state due to the fact that the vector or the 
composition does not constitute its natural enviromment. 

The term "purified" does not require the material to be present 
in a form of absolute purity, excluding the presence of other compounds. 
It is rather a relative definition. 

30 A polynucleotide or a polypeptide is in the purified state after 

purification of the starting material or of the natural material by at least 
one order of magnitude, preferably 2 or 3, and preferentially four or five 
orders of magnitude. 

For the purposes of the present invention, the expression 

35 "nucleotide sequence" may be used to denote equally a polynucleotide 
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or a nucleic acid. The expression "nucleotide sequence" encompasses 
the genetic material itself and is not therefore restricted to the 
information concerning its sequence. 

The terms "nucleic acid", "polynucleotide", "oligonucleotide" or 
5 alternatively " nucleotidique sequence" encompass RNA, DNA or cDNA 
sequences, or else RNA/DNA hybrid sequences, of more than one 
nucleotide, equally in single-stranded form or in the form of a duplex. 

The term "nucleotide" denotes both natural nucleotides (A, T, G, 
C) and modified nucleotides which comprise at least one modification 
10 such as (i) a purine analog, (ii) a pyrimidine analog, or (iii) a sugar 
analog, such modified nucleotides being described, for example, in PCT 
application No. WO 95/04064. 

For the purposes of the present invention, a first polynucleotide 
is considered to be "complementary" to a second polynucleotide when 
15 each base of the first nucleotide is paired with the complementary base 
of the second polynucleotide, the orientation of which is inverted. The 
complementary bases are A and T (or A and U), and C and G. 

According to the invention, a first nucleic acid having at least 
95% identity with a reference second nucleic acid would have at least 
20 95%, preferably at least 96%, 97%, 98%, 98.5%, 99%, 99.1%, 99.2%, 
99.3%, 99.4%, 99.5%, 99.6%, 99.7%, 99.8% or 99.9% nucleotide 
identity with this reference second polynucleotide, the percentage 
identity between two sequences being determined as described below. 

For the purpose of the present invention, the "percentage 
25 identity" between two nucleotide or amino acid sequences can be 
determined by comparing two optimally aligned sequences, through a 
window of comparison. 

The part of the nucleotide or polypeptide sequence in the 
window of comparison may thus comprise additions or deletions (for 
30 example "gaps") compared to the reference sequence (which does not 
comprise these additions or these deletions) so as to obtain an optimal 
alignment of the two sequences. 

The percentage is calculated by determining the number of 
positions at which an identical nucleic acid base or amino acid residue is 
35 observed for the two compared (nucleic acid or peptide) sequences, then 
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dividing the number of positions at which there is identity between the 
two compared bases or amino acid residues by the total number of 
positions in the window of comparison, and then multiplying the result by 
one hundred in order to obtain the percentage sequence identity. 
5 The optimal alignment of the sequences for the comparison 

may be carried out on a computer using known algorithms. 

Preferably, the percentage sequence identity is determined by 
means of the BLAST software (BLAST version 2.06 from September 
1 998), using exclusively the default parameters. 

10 A nucleic acid having at least 95% nucleotide identity with a 

nucleic acid according to the invention encompasses the "variants" of a 
nucleic acid according to the invention. 

The term "variant" of a nucleic acid according to the invention is 
intended to mean a nucleic acid which differs from the reference nucleic 

15 acid by one or more substitutions, additions or deletions of a nucleotide, 
compared to the reference nucleic acid. A variant of a nucleic acid 
according to the invention may be of natural origin, such as an allelic 
variant which exists naturally. Such a variant nucleic acid may also be an 
unnatural nucleic acid obtained, for example, by mutagenesis 

20 techniques. 

In general, the differences between the reference nucleic acid 
and the "variant" nucleic acid are small such that the reference nucleic 
acid and the variant nucleic acid are nucleotide sequences which are 
very similar and, in many regions, identical. The nucleotide modifications 

25 present in a variant nucleic acid may be silent, which means that they do 
not affect the amino acid sequence which may be encoded by this 
variant nucleic acid. 

The nucleotide modifications in the variant nucleic acid may 
also result in substitutions, additions or deletions of one or more amino 

30 acids in the sequence of the polypeptide which may be encoded by this 
variant nucleic acid. 

Entirely preferably, a variant nucleic acid according to the 
invention comprising an open reading frame encodes a polypeptide 
which conserves the same biological function or the same biological 

35 activity as the polypeptide encoded by the reference nucleic acid. 



12 



Entirely preferably, a variant nucleic acid according to the 
invention which comprises an open reading frame encodes a polypeptide 
which conserves the ability to be recognized by antibodies directed 
against the polypeptide encoded by the reference nucleic acid. 
5 The nucleic acids of the genes orthologous to ISUM2 included 

in the genome of plants other than maize, and having a nucleotide 
identity of at least 95% with a nucleic acid encoding the ISUM2A 
polypeptide, are part of the "variants" of a nucleic acid encoding the 
ISUM2A polypeptide. 

10 The "fragment" of a nucleic acid according to the invention is 

intended to mean a nucleotide sequence which is shorter compared to 
the reference nucleic acid, the nucleic acid fragment having a nucleotide 
sequence identical to the nucleotide sequence of the reference nucleic 
acid over the common portion. Such fragments of a nucleic acid 

15 according to the invention have at least 12, 15, 18, 20, 25, 30, 35, 40, 
45, 50, 60, 70, 80, 90, 100, 150, 200, 300, 400, 500, 1000, 2000 or 3000 
consecutive nucleotides of the reference nucleic acid, the maximum 
length in nucleotides of a fragment of a nucleic acid according to the 
invention being, of course, limited by the maximum length in nucleotides 

20 of the reference nucleic acid. 

The term "fragment" of an ISUM2A polypeptide according to the 
invention is intended to mean a polypeptide fragment which is shorter 
compared to the reference polypeptide, the polypeptide fragment having 
an amino acid sequence identical to the amino acid sequence of the 

25 reference polypeptide over the common portion. Such fragments of an 
ISUM2A polypeptide according to the invention have at least 5, 10, 15, 
20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 120, 130, 135 or 140 consecutive 
amino acids of a reference ISUM2A polypeptide. 

30 isum2A GENOMIC NUCLEIC ACIDS 

As indicated above, two allelic variants of the genomic nucleic 
acid of the isum2A gene have been characterized according to the 
invention, the two variant genomic nucleic acids being respectively 
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referenced as the nucleotide sequences SEQ ID No. 1 and SEQ ID No. 
2 of the sequence listing. 

Consequently, a subject of the present invention is a nucleic 
acid encoding an ISUM2A polypeptide, or encoding a fragment of this 
5 polypeptide, said nucleic acid comprising a polynucleotide having at least 
95% nucleotide identity with a nucleotide sequence chosen from SEQ ID 
No. 1 and SEQ ID No. 2, or with a fragment of one of the sequences 
SEQ ID No. 1 and SEQ ID No. 2. 

Also part of the invention is a nucleic acid of sequence 
10 complementary to the nucleic acid as defined above. 

Another subject of the invention is a nucleic acid consisting of a 
polynucleotide having at least 95% nucleotide identity with a sequence 
chosen from the sequences SEQ ID No. 1 and SEQ ID No. 2, or with a 
fragment of one of the sequences SEQ ID No. 1 or SEQ ID No. 2, or a 
is nucleic acid of complementary sequence. 

The invention also relates to a nucleic acid comprising at least 
12, preferably at least 15, and entirely preferably at least 20, consecutive 
nucleotides of the nucleic acid of sequence SEQ ID No. 1 or SEQ ID 
No. 2, it being understood that such a nucleic acid encompasses in its 
20 definition the "fragments" of a nucleic acid according to the invention as 
defined in the present description. 

The isum2A gene, defined by the sequence SEQ ID No. 1, 
comprises, from the 5* end to the 3' end, respectively: 

a) a noncoding sequence potentially carrying regulatory 
25 elements for transcription and/or translation of this gene, located 

upstream of the first exon, from the nucleotide at position 1 to the 
nucleotide at position 1812 of the sequence SEQ ID No. 1; 

b) a "coding" region which comprises the three exons and the 
two introns of the isum2A gene, this coding region being located from the 

30 nucleotide at position 1813 to the nucleotide at position 4074 of the 
sequence SEQ ID No. 1; and 

c) a noncoding region located downstream of the coding region, 
from the nucleotide at position 4075 to the nucleotide at position 5620 of 
the sequence SEQ ID No. 1. 
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The sequence SEQ ID No. 2, which illustrates a variant of the 
isum2A gene, does not contain the entire open reading frame encoding 
an ISUM2A polypeptide. The sequence SEQ ID No. 2 comprises a 
portion located on the 3' side of exon No. 1 , and also all of exons No. 2 
5 and No. 3. The nucleotide at position 1 of the sequence SEQ ID No. 2 
corresponds to the nucleotide at position 2063 of the sequence SEQ ID 
No. 1. The nucleotide at position 2004 of the sequence SEQ ID No. 2 
corresponds to the nucleotide at position 4062 of the sequence SEQ ID 
No. 1. 

10 However, knowledge of the sequence SEQ ID No. 2 by those 

skilled in the art and comparison thereof with sequence SEQ ID No. 1, 
makes it possible to directly obtain the entire coding sequence of the 
isum2A gene defined by the sequence SEQ ID No. 2, for example by 
completing the missing bases of exon No. 1 of the sequence SEQ ID 

15 No. 2 with the corresponding bases of exon No. 1 of the sequnce SEQ 
ID No. 1, using as a basis the corresponding positions given above and 
also in Table 1 below. 

Similarly, those skilled in the art can obtain the nucleic acid of 
the isum2A gene, for example by isolating, from the information of the 

20 sequence SEQ ID No. 1, the equivalent nucleotide regions or else by 
producing a hybrid nucleic acid obtained by fusing the nucleic acids of 
sequences SEQ ID No. 1 and SEQ ID No. 2 on the basis of the 
corresponding positions given above and also in Table 1 below. 

The structural characteristics of the three introns and of the two 

25 exons of the ISUM2A gene are given in detail in Table 1 below. 

TABLE 1 

Sequences of the exons of the lsum2A gene 



Exon 
No. 


Position of the nucleotide in the 
5' position on 

SEQ ID No. 1 SEQ ID No. 2 


Position of the nucleotide in the 
3' position on 

SEQ ID No. 1 SEQ ID No. 2 


1 


1813 


< 1 


2099 


37 


2 


2207 


146 


2323 


262 


3 


3646 


1588 


4074 


>2004 
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The invention also relates to a nucleic acid comprising at least 
12 consecutive nucleotides of an exonic polynucleotide of the lsum2A 
gene, such as the polynucleotides 1 to 3 described in Table 1 above, 
which are included in the nucleic acid of sequence SEQ ID No. 1 and 
5 SEQ ID No. 2. 

Such a nucleic acid encodes at least part of the polypeptide 
encoded by the lsum2A gene and can in particular be inserted into a 
recombinant vector intended for the expression of the corresponding 
translation product in a host cell or in a plant transformed with this 
10 recombinant vector. 

Such a nucleic acid may also be used for synthesizing 
nucleotide probes and primers intended for the detection or for the 
amplification of nucleotide sequences included in the lsum2A gene in a 
sample, where appropriate, of sequences of the lsum2A gene carrying 
15 one or more mutations, preferably one or more mutations which are such 
that they modify the phenotype of a plant carrying such a mutated 
lsum2A gene, by causing the production of grains altered in germ 
development. 

20 TABLE 2 

Sequences of the introns of the lsum2A gene 



Intron 


Position of the nucleotide in the 


Position of the nucleotide in the 


No. 


5' position on 




3' position on 






SEQ ID No. 1 


SEQ ID No. 2 


SEQ ID No. 1 


SEQ ID No. 2 


1 


2100 


38 


2206 


145 


2 


2324 


263 


3645 


1587 



The invention also relates to a nucleic acid comprising at least 
25 12 consecutive nucleotides of an intronic polynucleotide of the lsum2A 
gene, such as the polynucleotides 1 and 2 described in Table 2 above, 
which are included in the nucleic acid of sequence SEQ ID No. 1 and 
SEQ ID No. 2. 

Such a nucleic acid may be used as an oligonucleotide probe or 
30 primer for detecting the presence of at least one copy of the lsum2A 
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gene in a sample, or else for amplifying a given target sequence in the 
lsum2A gene. 

Such a nucleic acid may also be used for amplifying a given 
target sequence in the isum2A gene or inhibiting it by a sense or 
5 cosuppression approach, or using double-stranded RNA (Wassenegger 
et al. 1996; Kooter et al. 1999) for interference. Such a nucleic acid may 
also be used for searching for functional allelic variants of the ISUM2 
gene, which may be used in a method for selecting plants with an 
embryo modified in terms of size and/or development. 
10 The genomic region of the lsum2A gene essentially restricted to 

the "coding" region comprising the 3 exons and the 2 introns is defined 
as the sequence beginning at the nucleotide at position 1813 and ending 
at the nucleotide at position 4074 of the sequence SEQ ID No. 1 . 

Part of exon No. 1 , all of exons No. 2 and 3 and also the two 
15 introns of a variant of the isum2A gene are included in the sequence 
ranging from the nucleotide at position 1 to the nucleotide at position 
2004 of the sequence SEQ ID No. 2. 

It is specified that, over their common region, the sequences 
SEQ ID No. 1 and SEQ ID No. 2 have a percentage nucleotide identity of 
20 greater than 95%, this percentage in fact being greater than 99%. 

A subject of the invention is also a nucleic acid comprising a 
polynucleotide having at least 95% nucleotide identity with the nucleotide 
sequence beginning at the nucleotide at position 1813 and ending at the 
nucleotide at position 4074 of the sequence SEQ ID No. 1, and also a 
25 nucleic acid of complementary sequence. 

The invention also relates to a nucleic acid having at least 95% 
nucleotide identity with the nucleotide sequence beginning at the 
nucleotide at position 1813 and ending at the nucleotide at position 4074 
of the sequence SEQ ID No. 1, and also a nucleic acid of 
30 complementary sequence. 

A subject of the invention is also a nucleic acid comprising the 
nucleotide sequence beginning at the nucleotide in position 1813 and 
ending at the nucleotide at position 4074 of the sequence SEQ ID No. 1 
or a nucleic acid of complementary sequence. 



17 



The invention also relates to a nucleic acid consisting of the 
nucleotide sequence beginning at the nucleotide at position 1813 and 
ending at the nucleotide at position 4074 of the sequence SEQ ID No. 1 
or a nucleic acid of complementary sequence. 
5 Another subject of the invention consists of a nucleic acid 

characterized in that it comprises one of the following nucleotide 
sequences: 

a) the sequence ranging from the nucleotide at position 1 to the 
nucleotide at position 1812 of the sequence SEQ ID No. 1 f or a nucleic 

10 acid of complementary sequence; 

b) the sequence ranging from the nucleotide at position 1813 to 
the nucleotide at position 2099 of the sequence SEQ ID No. 1, or a 
nucleic acid of complementary sequence; 

c) the sequence ranging from the nucleotide at position 2100 to 
15 the nucleotide at position 2206 of the sequence SEQ ID No. 1, or a 

nucleic acid of complementary sequence; 

d) the sequence ranging from the nucleotide at position 2207 to 
the nucleotide at position 2323 of the sequence SEQ ID No. 1, or a 
nucleic acid of complementary sequence; 

20 e) the sequence ranging from the nucleotide at position 2324 to 

the nucleotide at position 3645 of the sequence SEQ ID No. 1, or a 
nucleic acid of complementary sequence; 

f) the sequence ranging from the nucleotide at position 3646 to 
the nucleotide at position 4074 of the sequence SEQ ID No. 1, or a 

25 nucleic acid of complementary sequence; and 

g) the sequence ranging from the nucleotide at position 4075 to 
the nucleotide as position 5620 of the sequence SEQ ID No. 1, or a 
nucleic acid of complementary sequence. 

The nucleic acid of sequence SEQ ID No. 1 is represented in 
30 Figure 1 , in which details are also given of the positions of the various 
exons and introns of the lsum2A gene. 

PRODUCTS OF TRANSCRIPTION OF THE lsum2A GENE 
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It has been shown, according to the invention, that the lsum2A 
gene is transcribed in the form of a messenger RNA. This messenger 
RNA comprises an open reading frame encoding the ISUM2>4 protein. 

The part of the cDNA of the lsum2A gene comprising the open 
5 reading frame encoding the ISUM2A polypeptide of sequence SEQ ID 
No. 5 is 833 nucleotides in length and is referenced as the sequence 
SEQ ID No. 3 of the sequence listing. The cDNA of sequence SEQ ID 
No. 3 derives from the wild-type plant variant denoted HD5xHD7. This 
cDNA is represented in Figure 2. 
10 The part of the cDNA of the lsum2A gene comprising part of the 

open reading frame encoding the ISUM2A polypeptide of sequence SEQ 
ID No. 6 is 621 nucleotides in length and is referenced as the sequence 
SEQ ID No. 4 of the sequence listing. The cDNA of sequence SEQ ID 
No. 4 derives from the wild-type plant variant denoted A188. This partial 
15 cDNA is represented in Figure 3. 

The nucleic acids of sequences SEQ ID No. 3 and SEQ ID 
No. 4 exhibit similarities with an EST sequence referenced in the 
GENBANK database under the accession number AI001298 and 
denoted "ISUM2". For this reason, the newly discovered gene according 
20 to the invention has been given the name "isum2A". 

The sequence of the EST No. AI001298 has a degree of 
nucleotide identity of 94% and 95% with the sequences SEQ ID No. 3 
and SEQ ID No. 4, respectively. No open reading frame is described for 
this EST. 

25 The nucleic acids of sequences SEQ ID No. 3 and SEQ ID 

No. 4 also exhibit similarities with an EST sequence referenced in the 
GENBANK database under the accession number AI374506. This 
sequence was obtained from the same clone "MEST6-D3" as the 
sequence Al 001298. The sequence of the EST No. AI374506 has a 

30 degree of nucleotide identity of 92% and 98% with the sequences SEQ 
ID No. 3 and SEQ ID No. 4, respectively. No open reading frame is 
described for this EST. 

Another subject of the invention consists of a nucleic acid 
comprising a polynucleotide encoding an ISUM2A polypeptide and 

35 having at least 99% nucleotide identity with the nucleotide sequence 
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SEQ ID No. 3 or SEQ ID No. 4, or with a fragment of this nucleotide 
sequence, and also a nucleic acid of sequence complementary to these 
nucleic acids. 

The invention also relates to a nucleic acid encoding an 
5 ISUM2A polypeptide and having at least 99% nucleotide identity with the 
nucleotide sequence SEQ ID No. 3 or SEQ ID No. 4, or a fragment of 
this nucleotide sequence, and also a nucleic acid of sequence 
complementary to these nucleic acids. 

A subject of the invention is also a nucleic acid characterized in 
10 that it comprises the nucleotide sequence SEQ ID No. 3 or SEQ ID No. 

4 or a nucleic acid of complementary sequence. 

Also part of the invention is a nucleic acid consisting of the 
nucleotide sequence SEQ ID No. 3 or SEQ ID No. 4, or a nucleic acid of 
complementary sequence, 
is The lsum2A gene encodes a polypeptide 143 amino acids in 

length, two allelic variants of which have been identified according to the 
invention, respectively the variant polypeptides of sequence SEQ ID No. 

5 and SEQ ID No. 6, which have a degree of identity of more than 99% 
with one another. 

20 Consequently, the invention also relates to a nucleic acid 

encoding a polypeptide having at least 95% amino acid identity with the 
sequence SEQ ID No. 5 or SEQ ID No. 6. 

The invention also relates to a nucleic acid characterized in that 
it encodes the polypeptide of sequence SEQ ID No. 5 or SEQ ID No. 6. 

25 Also part of the invention is a nucleic acid encoding a 

"fragment" of a polypeptide having at least 95% nucleotide identity with a 
polypeptide of amino acid sequence SEQ ID No. 5 or SEQ ID No. 6. 

The invention also relates to a nucleic acid encoding a fragment 
of a polypeptide of amino acid sequence SEQ ID No. 5 or SEQ ID No. 6. 

30 A polypeptide having at least 95% amino acid identity with a 

reference polypeptide comprises at least 96%, 97%, 98%, 99%, 99.1%, 
99.2%, 99.3%, 99.4%, 99.5%, 99.6%, 99.7%, 99.8% or 99.9% amino 
acid identity with the reference polypeptide. 

The "variant" polypeptides are encompassed in the definition of 

35 a polypeptide having at least 95% amino acid identity with a reference 
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polypeptide according to the invention. The "variant" of a polypeptide 
according to the invention is intended to mean a polypeptide the amino 
acid sequence of which comprises one or more substitutions, additions 
or deletions of at least one amino acid residue, compared to the amino 
5 acid sequence of the reference polypeptide, it being understood that the 
amino acid substitutions may be conservative or nonconservative in 
nature. 

A variant of a reference polypeptide according to the invention 
consists of a polypeptide which conserves the biological function or the 

10 biological activity of the reference polypeptide and/or which is recognized 
by antibodies directed against the reference polypeptide. These 
polypeptide variants may result from allelic variations characterized by 
differences in the nucleotide sequences of the gene encoding these 
polypeptides. Such polypeptide variants may also result from alternative 

15 splicing or from post-translation modifications. 

The term "fragment" of a reference polypeptide according to the 
invention is intended to mean a polypeptide having at least 10, 15, 20, 
25, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 135 or 140 
consecutive amino acids of a polypeptide as defined in the present 

20 description. 

PROBES AND PRIMERS ACCORDING TO THE INVENTION 

The nucleic acids according to the invention, and in particular 
25 the nucleotide sequences SEQ ID No. 1 to SEQ ID No. 4, their 
fragments of at least 12 nucleotides, the sequences having at least 95% 
nucleotide identity with at least part of the sequences SEQ ID No. 1 to 
SEQ ID No. 4, and also the nucleic acids of complementary sequence, 
are useful for detecting the presence of at least one copy of a nucleotide 
30 sequence of the lsum2A gene or else of a fragment or else of an allelic 
variant of this sequence, in a sample. 

Also part of the invention are the nucleotide probes and primers 
which hybridize, under high stringency hybridization conditions, with a 
nucleic acid chosen from the sequences SEQ ID No. 1 to SEQ ID No. 4. 
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The hybridization conditions below are used for the 
hybridization of a nucleic acid, probe or primer, 20 bases in length. 

The level and the specificity of hybridization depend on various 
parameters, such as: 
5 a) the purity of the preparation of the nucleic acid to which the 

probe or the primer must hybridize; 

b) the base composition of the probe or of the primer, G-C base 
pairs having greater thermal stability than A-T or A-U base pairs; 

c) the length of the sequence of bases which are homologous 
10 between the probe or the primer and the nucleic acid; 

d) the ionic strength: the hybridization rate increases with an 
increase in ionic strength and the duration of the incubation time; 

e) the incubation temperature; 

f) the concentration of the nucleic acid to which the probe or the 
15 primer must hybridize; 

g) the presence of denaturing agents such as agents which 
promote cleavage of hydrogen bonds, for instance formamide or urea, 
which increase the stringency of the hybridization; 

h) the incubation time, the hybridization rate increasing with the 
20 duration of incubation; 

i) the presence of volume excluders, such as dextran or dextran 
sulfate, which increase the rate of hybridization due to the fact that they 
increase the effective concentrations of the probe or of the primer and of 
the nucleic acid which must hybridize, within the preparation. 

25 The parameters defining the conditions of stringency depend on 

the temperature at which 50% of the paired strands separate (Tm). 

For sequences comprising more than 360 bases, Tm is defined 
by the relationship: 

Tm = 81.5 + 0.41 (% G+C)+16.6 Log(cation concentration) - 
30 0.63 (% formamide)-(600/number of bases) (SAMBROOK et al., (1989), 
pages 9.54-9.62). 

For sequences less than 30 bases in length, Tm is defined by 
the relationship: Tm = 4(G+C) + 2(A+T). 
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Under suitable conditions of stringency, under which aspecific 
sequences do not hybridize, the hybridization temperature is 
approxiamtely from 5 to 30°C, preferably from 5 to 10°C, below Tm. 

The expression "high stringency hybridization conditions" 
5 according to the invention is intended to mean hybridization conditions 
such that the procedure is carried out at a hybridization temperature of 
5°C below the Tm. 

The hybridization conditions described above can be adjusted 
as a function of the length and of the base composition of the nucleic 
10 acid for which hybridization is sought or of the type of labeling chosen, 
according to techniques known to those skilled in the art. 

Suitable hybridization conditions may, for example, be adjusted 
according to the teaching contained in the work by HAMES and 
HIGGINS (1985) or else in the work by AUSUBEL et al. (1989). 
15 By way of illustration, the hybridization conditions used for a 

nucleic acid 200 bases in length are as follows: 
Prehvbridization: 

same conditions as for the hybridization 

duration: overnight. 

20 

Hybridization: 

5 x SSPE (0.9 M NaCI, 50 mM sodium phosphate, pH 7.7, 
5 mM EDTA) 

5 x Denhardfs (0.2% PVP, 0.2% Ficoll, 0.2% BSA) 
25 100 pg/ml salmon sperm DNA 

0.1% SDS 
duration: overnight. 

Washes : 

2 x SSC, 0.1% SDS 10 min 65°C 
30 1 xSSC, 0.1%SDS10min65°C 

0.5 x SSC, 0.1% SDS 10 min 65°C 
0.1 x SSC, 0.1% SDS 10 min 65°C. 

The nucleotide probes or primers according to the invention 
35 comprise at least 12 consecutive nucleotides of a nucleic acid according 
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to the invention, in particular of a nucleic acid of sequences SEQ ID Nos. 
1 to 4 or of the sequence complementary thereto, of a nucleic acid 
having 95% nucleotide identity with a sequence chosen from the 
sequences SEQ ID Nos. 1 to 4 or of the sequence complementary 
5 thereto, or else of a nucleic acid which hybridizes, under high stringency 
hybridization conditions, with a sequence chosen from the sequences 
SEQ ID Nos. 1 to 4 or of the sequence complementary thereto. 

Preferably, nucleotide probes or primers according to the 
invention will have a length of at least 12, 15, 18, 20, 25, 30, 35, 40, 45, 

10 50, 60, 100, 150, 200, 300, 400, 500, 1000, 2000 or 3000 consecutive 
nucleotides of a nucleic acid according to the invention. 

Alternatively, a nucleotide probe or primer according to the 
invention will consist of and/or will comprise the fragments with a length 
of 12, 15, 18, 20, 25, 30, 35, 40, 45, 50, 60, 100, 150, 200, 300, 400, 

15 500, 1000, 2000 or 3000 consecutive nucleotides of a nucleic acid 
according to the invention. 

Examples of primers or pairs of primers for amplifying a nucleic 
acid fragment of the lsum2A gene of sequence SEQ ID No. 1 or SEQ ID 
No. 2 are, for example, the primers SEQ ID Nos. 10 to 13 and 17 to 23. 

20 The use of the primers of sequences SEQ ID Nos. 10 to 13 and 

17 to 23 in amplification reactions is described in the examples. 

A nucleotide primer or probe according to the invention can be 
prepared by any suitable method well known to those skilled in the art, 
including by cloning and the action of restriction enzymes, or else by 

25 direct chemical synthesis according to techniques such as the 
phosphodiester method of NARANG et al. (1979) or of BROWN et al. 
(1979), the diethylphosphroamidite method of BEAUCAGE et al. (1980) 
or else the solid support technique described in European patent No. 
EP 0 707 592. Each of the nucleic acids according to the invention, 

30 including the oligonucleotide probes and primers described above, can 
be labeled, if desired, by incorporating a molecule which is detectable, 
i.e. a label which is detectable, by spectroscopic, photochemical, 
biochemical, immunochemical or else chemical means. 
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For example, such labels can consist of radioactive isotopes 
( 32 P 3 H, 35 S), fluorescent molecules (5-bromodeoxyuridine, fluorescein, 
acetylaminofluorene) or else ligands such as biotin. 

The labeling of the probes is preferably carried out by 
5 incorporation of labeled molecules into the polynucleotides by primary 
extension, or else by addition to the 5' or 3' ends. 

Examples of nonradioactive labeling of nucleic acid fragments 
are described in particular in French patent No. FR 78 10 975 or else in 
the articles by URDEA et al. (1988) or SANCHEZ PESCADOR et al. 
10 (1988). 

Advantageously, the probes according to the invention may 
have structural characteristics such that they allow amplification of the 
signal, such as the probes described by URDEA et al. (1991) or else 
European patent No. EP 0 225 807 (Chiron). 
15 The oligonucleotide probes according to the invention can be 

used in particular in Southern-type hybridizations to the genomic DNA of 
the lsum2A gene or else in hybridizations to the messenger RNA of this 
gene when the expression of the corresponding transcript is sought in a 
sample. 

20 The probes according to the invention can also be used for 

detecting PCR amplification products or else for detecting mismatches. 

Nucleotide probes or primers according to the invention can be 
immobilized on a solid support. Such solid supports are well known to 
those skilled in the art and comprise surfaces of microtitration plate wells, 

25 polystyrene beads, magnetic beads, nitrocellulose strips or else 
microparticles such as latex particles. 

Consequently, a subject of the invention is also a nucleic acid 
which can be used as a nucleotide probe or primer, characterized in that 
it comprises at least 12 consecutive nucleotides of a nucleic acid as 

30 defined above, in particular of a nucleic acid of nucleotide sequences 
SEQ ID No. 1 to SEQ ID No. 4. 

The invention also relates to a nucleic acid which can be used 
as a nucleotide probe or primer, characterized in that it consists of a 
polynucleotide of at least 12 consecutive nucleotides of a nucleic acid 

35 according to the invention, most preferably of a nucleic acid of 
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sequences chosen from the nucleotide sequences SEQ ID No. 1 to SEQ 
ID No. 4. 

As described above, such a nucleic acid may also be 
characterized in that it is labeled with a detectable molecule. 
5 A nucleic acid which can be used as a nucleotide probe or 

primer for detecting or amplifying a genomic, mRNA or cDNA sequence 
of the lsum2A gene can also be characterized in that it is chosen from 
the following sequences: 

a) the nucleotide sequences which hybridize, under high 
10 stringency hybridization conditions, with a nucleic acid of sequence SEQ 

ID No. 1 orSEQIDNo. 2; and 

b) the sequences comprising at least 12 consecutive 
nucleotides of a nucleic acid of sequence SEQ ID No. 1 or SEQ ID No. 
2. 

15 The present invention also relates to a method for detecting the 

presence of a nucleic acid of the lsum2A gene in a sample, said method 
comprising the steps of: 

1 ) bringing a nucleotide probe or a plurality of nucleotide probes 
according to the invention into contact with the test sample; 
20 2) detecting the complex possibly formed between the probe(s) 

and the nucleic acid present in the sample. 

According to a particular embodiment of the method of 
detection according to the invention, the oligonucleotide probe(s) is (are) 
immobilized on a support. 
25 According to another aspect, the oligonucleotide probes 

comprise a detectable label. 

The invention also relates to a pack or kit for detecting the 
presence of a nucleic acid according to the invention in a sample, said 
pack comprising: 
30 a) one or more nucleotide probes as defined above; 

b) where appropriate, the reagents required for the hybridization 

reaction. 

According to a first aspect, the detection pack or kit is 
characterized in that the probe(s) is (are) immobilized on a support. 
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According to a second aspect, the detection pack or kit is 
characterized in that the oligonucleotide probes comprise a detectable 
label. 

According to a particular embodiment of the detection kit 
5 described above, such a kit will comprise a plurality of oligonucleotide 
probes in accordance with the invention, which will be used for detecting 
target sequences of interest of the lsum2A gene, or alternatively 
detecting mutations in the coding regions or the noncoding regions of the 
lsum2A gene, more particularly nucleic acids of sequence SEQ ID No. 1 
10 to SEQ ID No. 4 or the nucleic acids of complementary sequence. 

For the purpose of the invention, the term "target sequence" is 
intended to mean a nucleotide sequence comprised in a nucleic acid, 
said nucleotide sequence hybridizing, under the hybridization conditions 
specified in the description, with a nucleotide probe or primer of the 
15 invention. 

A target sequence may be, for example, a sequence comprised 
in a regulatory nucleic acid for the lsum2A gene or else a sequence 
comprised in a genomic coding region or a coding region of the cDNA of 
this gene. 

20 The nucleotide primers according to the invention may be used 

for amplifying any nucleotide fragment (gDNA, cDNA, mRNA) of the 
lsum2A gene, and more particularly all or part of a nucleic acid of 
sequence SEQ ID No. 1 to SEQ ID No. 4, or else a fragment or a variant 
of these sequences. 

25 Another subject of the invention concerns a method for 

amplifying a nucleic acid according to the invention, and more 
particularly a nucleic acid of sequence SEQ ID No. 1 to SEQ ID No. 4 or 
a fragment or an allelic variant thereof, contained in a sample, said 
method comprising the steps of : 

30 a) bringing the sample in which the presence of the target 

nucleic acid is suspected into contact with a pair of nucleotide primers 
the hybridization position of which is located, respectively, on the 5' side 
and on the 3' side of the region of the target nucleic acid of the lsum2A 
gene whose amplification is desired, in the presence of the reagents 

35 required for the amplification reaction; and 
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b) detecting the nucleic acid possibly amplified. 
To implement the method of amplification as defined above, 
use will advantageously be made of any one of the nucleotide primers 
described hereinafter. 
5 A subject of the invention is also a pack or kit for amplifying a 

nucleic acid according to the invention, and more particularly all or part of 
a nucleic acid of sequences SEQ ID Nos. 1 to 4, said pack or kit 
comprising: 

a) a pair of nucleotide primers in accordance with the invention, 
10 the hybridization position of which is located, respectively, on the 5' side 

and on the 3* side of the target nucleic acid of the lsum2A gene whose 
amplification is desired; 

b) where appropriate, the reagents required for the amplification 

reaction. 

15 Such an amplification pack or kit will advantageously comprise 

at least one pair of nucleotide primers as described above. 

According to a preferred embodiment, primers according to the 
invention comprise all or part of a polynucleotide chosen from the 
nucleotide sequences SEQ ID Nos. 10 to 13 and 17 to 23. 

20 NUCLEIC ACID CONSTRUCTS ACCORDING TO THE INVENTION 

The isolation and characterization of the isum2A gene 
according to the invention and the demonstration of the need for 
expression of the isum2A gene at a detectable level in the plant in order 
to obtain normal development of the embryo of the grain, after 

25 pollenation, have enabled the inventors to prepare nucleic acid 
constructs which allow the expression of at least one functional copy of 
the isum2A gene in a cellular host, particularly in a plant cell transformed 
with such nucleic acid constructs. 

Preferably, said cell carries at least one nonfunctional isum2A 

30 allele. Entirely preferably, said cell carries the two nonfunctional isum2A 
alleles. 

According to the invention, nucleic acid constructs have been 
developed which make it possible to obtain controlled expression of at 
least one functional copy of a nucleic acid encoding an ISUM2A 
35 polypeptide in a cellular host, particularly in a plant cell. 
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Consequently, a subject of the invention is also a nucleic acid 
comprising a polynucleotide encoding an ISUM2A polypeptide as defined 
in the present description, or encoding a fragment of this polypeptide, 
said nucleic acid also comprising a regulatory polynucleotide capable of 
5 regulating the transcription or the translation of the polynucleotide 
encoding the ISUM2A polypeptide, or the fragment of this polypeptide. 

Also part of the invention is the nucleic acid of sequence 
complementary to the nucleic acid as defined above. 

A nucleic acid construct comprising a polynucleotide encoding 
10 an ISUM2A polypeptide and also a regulatory polynucleotide is also 
referred to as "expression cassette" in the present description. In 
general, an expression cassette according to the invention will comprise 
at least one polynucleotide encoding an ISUM2A polypeptide, it being 
possible for the other functional elements which allow expression of the 
15 ISUM2A polypeptide to be carried by a vector into which the expression 
cassette can be inserted. 

An expression cassette according to the invention may also 
contain, in a nonlimiting manner, besides a regulatory polynucleotide, 
other functional elements such as leader sequences or a terminator 
20 sequence or else transcription initiation and stop sequences. 

According to another embodiment of a nucleic acid construct of 
the invention, use is made of promoters known to direct the expression 
of the fused nucleic acid sequence in a constitutive or tissue-specific 
(grain) or developmental stage-specific manner. 
25 By way of example, mention may be made of: 
a) constitutive promoters: 

- the 35S promoter of the cauliflower mosaic virus, or the 19S 
promoter, or advantageously the double 35S constitutive 
promoter (pd35S), described in the article by Kay et al., 1987; 

30 - the rice actin promoter followed by the rice actin intron (pAR- 

IAR) contained in the plasmid pAct1-F4 described by Mc Elroy 
et al., 1991; 

- the EF-1a constitutive promoter of the gene encoding plant 
elongation factor described in PCT application No. 

35 WO 90/021 72 or else in the article by AXELOS et al. (1 989); 
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- the chimeric super-promoter PSP (Nl et al., 1995) consisting of 
the fusion of three copies of the transcriptional activity element 
of the promoter of the octopine synthase gene from 
Agrobacterium tumefaciens and of the transcription-activating 

5 element of the promoter of the mannopine synthase gene from 

Agrobacterium tumefaciens] and 

- the sunflower ubiquitin promoter (BINET et al., 1991); 

- the maize ubiquitin 1 promoter (CHRISTENSEN et al., 1996); 

- the maize ubiquitin 1 promoter (Christensen et al., 1996). 
10 b) specific promoters: 

- grain-specific promoters (R. Datla et al., 1997), in particular the napin 
promoter (EP 255 378), the phaseolin promoter (Riggs et al., 1989), 
the glutenin promoter, the helianthinin promoter (WO 92/17580), the 
albumin promoter (WO 98/45460), the oleosin promoter 

15 (WO 98/45461), the ATS1 promoter or the ATS3 promoter 
(WO 99/20775); 

- Esr promoters as described in application PCT/FR00/02596, which 
allow both specific expression at the interface between the embryo 
and the albumen and early expression during development of the 

20 albumen; 

- the promoter of the Vp1 gene described by Mc Carty et al. (1989). 
The regulatory nucleic acids above can be used to overexpress the 

polynucleotide encoding the ISUM2A polypeptide or else a fragment of 
the ISUM2A polypeptide, in particular when it is desired to obtain plants 

25 having grains with large embryos rich in oil. 

Thus, the invention also relates to a nucleic acid encoding an 
ISUM2A polypeptide or encoding a fragment of an ISUM2A polypeptide 
as defined above, and comprising a regulatory polynucleotide which 
regulates the transcription and/or the translation of the coding sequence, 

30 in which the regulatory polynucleotide allows a high level of transcription 
of the corresponding mRNA and/or a high level of translation of the 
corresponding polypeptide in the host organism, including host cell or 
plant, in which it is expressed. 

The invention also relates to the use of a nucleic acid as defined 

35 above, of a recombinant vector comprising this nucleic acid or else of a 



30 



host cell transfected or transformed with this nucleic acid, for obtaining a 
transformed plant capable of producing grains rich in oil. 

In a preferred embodiment, controlled expression of the 
polynucleotide encoding the ISUM2A polypeptide is sought, most 
5 particularly when the production of grains with altered germ development 
is desired. 

As indicated above, a deficiency in the expression of the 
isum2A gene causes the production of germ-free grains, these grains 
being infertile and not allowing multiplication of the plants mutated in the 
10 isum2A gene. 

In addition, expression of the isum2A gene appears to be 
necessary for normal growth of the plant before pollination. 

Pollination is the moment at which the mature pollen comes into 
contact with a receptive bristle. 
15 It results therefrom that the obtaining of a production of grains 

with altered germ development by a plant involves: 

a) normal expression of at least one functional copy of a nucleic 
acid encoding an ISUM2A polypeptide from the grain germination stage 
until pollination of the plant; and 
20 b) an absence of transcription or of translation of this nucleic 

acid once pollination of the plant has occurred, in order to produce grains 
altered in germ development. 

The absence of transcription or of translation of the nucleic acid 
encoding an ISUM2A polypeptide from the stage of pollination alters or 
25 blocks the development of the grain embryo in order to achieve the 
desired aim. 

When reproduction and multiplication of the plants of interest is 
sought, and when the production of normal and fertile grains is desired, 
normal expression of the ISUM2A polypeptide will be pursued throughout 
30 the development of the plant, including after pollination. 

To pursue the aim of the invention, it is therefore particularly 
advantageous to use nucleic acid constructs in which the polynucleotide 
encoding an ISUM2A polypeptide is placed under the control of a 
regulatory polynucleotide, the activity of which can be controlled over 
35 time. 
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The invention therefore also relates to a nucleic acid comprising 
a polynucleotide encoding an ISUM2A polypeptide, chosen from the 
sequences having at least 95% amino acid identity with the sequences 
SEQ ID No. 5 and SEQ ID No. 6, and which also comprises a regulatory 
5 polynucleotide sensitive to the direct or indirect action of an inducing 
signal, also referred to as inducible regulatory polynucleotide. 

According to a first aspect, the regulatory polynucleotide is a 
transcription- or translation-"repressing" polynucleotide. 

According to the invention, the expression "repressor" 
10 regulatory polynucleotide is intended to mean a regulatory sequence the 
constitutive activity of which can be blocked by an external signal. Such 
an external signal may be the absence of binding of a transcription factor 
recognized by the repressor regulatory polynucleotide. The absence of 
binding of the transcription factor may be induced under the effect of the 
15 repressor inducer signal to which the repressor regulatory polynucleotide 
is sensitive. 

According to this first particular embodiment, the expression of 
the sequence encoding an ISUM2A polypeptide is constitutive in the 
chosen cellular host, in the absence of the repressor inducer signal to 
20 which the repressor regulatory polynucleotide is directly or indirectly 
sensitive. 

Bringing the cellular host into contact with the repressor inducer 
signal has the effect, by virtue of a direct or indirect action on the 
repressor regulatory polynucleotide, of inhibiting and/or blocking the 
25 expression of the polynucleotide encoding the ISUM2A polypeptide. 

In order to produce the DNA constructs according to the 
invention comprising a repressor regulatory polynucleotide, those skilled 
in the art will make use of their technical general knowledge in the field of 
gene expression in plants. 
30 According to a second embodiment of a nucleic acid construct 

of the invention, the regulatory polynucleotide is a transcription- or 
translation-activating polynucleotide. 

Entirely preferably, the transcription- or translation-activating 
regulatory polynucleotide is directly or indirectly sensitive to the action of 
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an activator inducer signal. This is then an "inducible activator" 
polynucleotide for the purpose of the invention. 

According to the invention, a regulatory polynucleotide of the 
"inducible activator" type is a regulatory sequence which is only activated 
5 in the presence of an external signal. Such an external signal may be the 
binding of a transcription factor, it being possible for the binding of a 
transcription factor to be induced under the effect of the activator inducer 
signal to which the regulatory polynucleotide is directly or indirectly 
sensitive. 

10 When such a nucleic acid construct is used in a cellular host, 

expression of the polynucleotide encoding an ISUM2A polypeptide 
according to the invention may be induced by bringing the transformed 
cellular host into contact with the activator inducer signal to which the 
activator regulatory polynucleotide is directly or indirectly sensitive. 

15 When an absence of expression of the polynucleotide encoding 

an ISUM2A polypeptide in this transformed cellular host is sought, it is 
then sufficient to eliminate or suppress the presence of the activator 
inducer signal to which the transcription- or translation-regulating 
polynucleotide is sensitive. 

20 Those skilled in the art will make use of their technical general 

knowledge in the field of regulatory polynucleotides, in particular those 
which are active in plants, to define the constructs corresponding to the 
definition of the second embodiment above. 

In the interests of complete clarity in the explanation of the 

25 characteristics of the nucleotide constructs which are preferably used to 
obtain a production of germ-free grains, several inducible and 
controllable systems for the expression of an ISUM2A polypeptide in 
plants are defined below. The general characteristics of suitable 
expression systems are first of all summarized in Table 3 below, and 

30 their functional aspects are then given in detail. 
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Table 3 : 

Examples of inducible expression systems for ISUM2A. 







Induction system 


Expression system 




background of the 
cellular host or of 
the plant 


Promoter 


Gene 


Promoter 


Gene 


I 
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Activator 
regulat- 
able by 
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Under the 
control of 
the 


lsum2A 


II 

II 
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Constitutive 
or embryo- 
specific 


Repressor 
regulat- 
able by 
inducer 


Under the 
control of 
the 

repressor 


lsum2A 


III 


lsum2A*/lsum2A* a 


Constitutive 
or embryo- 
specific 


Activator 
regulat- 
able by 
inducer 


Under the 
control of 
the 

activator 


lsum2A 
antisense 
poly- 
nucleotide 



a isum 2A7isum2A": homozygous for a nonfunctional copy of the isum2A 
gene, for example a mutation. 
5 b lsum 2A + /lsum2A + : homozygous carrying two functional copies of the 
isum2A gene. 

Description of the inducible expression system I from Table 3 

The inducible expression system I from Table 3 constitutes an 
10 illustration of a nucleic acid construct in which the regulatory 
polynucleotide is an "inducible" transcription- and/or translation- 
activating" polynucleotide according to the second embodiment defined 
above. 

The expression system I comprises: 
15 (i) a first expression cassette comprising a nucleic acid 

encoding an ISUM2A polypeptide placed under the control of a 
promoter, the activity of which is induced only in the presence of an 
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activator compound, when this activator compound, in general a 
transcription factor, is bound to an inducer compound; 

(ii) a second expression cassette comprising a nucleic acid 
encoding the activator compound, for example the transcription factor 
5 above, placed under the control of a promoter which allows constitutive 
expression of the activator compound. 

According to the system I, the transcription or the translation of 
the sequence encoding an ISUM2A polypeptide is induced only when the 
activator compound is complexed with the inducer compound. The 
10 complex between the activator compound and the inducer compound 
binds to the promoter controlling the expression of the nucleic acid 
encoding an ISUM2A polypeptide, and activates the expression of the 
latter. 

An example of the system I is illustrated in Example 3, in which 
15 the system comprises: 

(i) a nucleic acid encoding an ISUM2A polypeptide placed 
under the control of a promoter containing the UAS sequence, which is 
recognized by the GVG activator compound. The GVG activator 
compound is a protein from fusion between the DNA-binding domain of 

20 the GAL4 protein, the VP16 gene activing domain and the rat 
glucocorticoid receptor (GR); 

(ii) a nucleic acid encoding the GVG fusion protein defined 
above, which is placed under the control of a constitutive promoter. 

In Example 3, the GVG fusion protein is expressed 
25 constitutively in the plant, but does not activate the promoter controlling 
the expression of the ISUM2A polypeptide in the absence of 
glucocorticoid. 

On the other hand, when the plant is brought into contact with 
the inducer compound represented by a glucocorticoid, the 

30 glucocorticoid binds to the GVG fusion protein and the complex thus 
formed between the activator inducer compound (glucocorticoid) and the 
activator compound (the GVG fusion protein) will bind to the UAS 
sequence of the promoter controlling the expression of an ISUM2A 
polypeptide, which activates the promoter containing the UAS sequence 

35 and induces the synthesis of the ISUM2A polypeptide. 
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In the inducible expression system above, the glucocorticoid 
constitutes the inducer compound. 

The activator inducer signal is the binding of the activator 
compound/inducer compound (GVG fusion polypeptide/glucocorticoid) 
5 complex to the activator regulatory polynucleotide (promoter containing 
the UAS sequence). 

The expression system I will preferably be used in a plant 
cellular host or in a plant in which the two copies of the isum2A gene are 
nonfunctional, for example in which the two copies of the isum2A gene 
10 are mutated. 

Inducible expression system II from Table 3 

The inducible expression system II is an illustration of the 

embodiment of a nucleic acid construct according to the invention, 
15 according to the first embodiment set out above, in which the 

polynucleotide encoding an ISUM2A polypeptide is placed under the 

control of a "repressor" regulatory polynucleotide. 

In the system II according to Table 3, the expression system 

contains two expression cassettes, respectively: 
20 (i) a first expression cassette comprising a nucleic acid 

encoding an ISUM2A polypeptide, placed under the control of a 

regulatory region, the activity of which is constitutive because it is 

induced in the presence of an activator compound produced 

constitutively; 

25 (ii) a second expression cassette comprising a nucleic acid 

encoding the activator compound which is active on the regulatory region 
of the expression cassette (i) above, said nucleic acid encoding the 
activator compound being placed under the control of a constitutive 
promoter, preferably a strong constitutive promoter, or else a promoter 

30 which is active specifically in the cells of the embryo of the grain. 

In the system II, the activator compound loses its function of 
activation of the regulatory region controlling the expression of the 
nucleic acid encoding an ISUM2A polypeptide, when this activator 
compound is brought into contact with a specific ligand, which is here 

35 referred to as repressor inducer compound. 
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Thus, in the absence of the specific ligand constituting the 
repressor inducer compound, the activator compound is expressed 
constitutively and activates the expression of an ISUM2A polypeptide. 

On the other hand, when the activator compound is brought into 
5 contact with the repressor inducer compound with which it binds, the 
activator compound is deactivated and no longer binds to the regulatory 
region controlling the expression of the nucleic acid encoding an 
ISUM2A polypeptide. In this situation, the ISUM2A polypeptide is no 
longer expressed. 

10 Example 2 illustrates a particular embodiment of a system II as 

defined above. The system II presented in Example 2 comprises, 
respectively: 

(i) a first expression cassette comprising a nucleic acid 
encoding the activator compound tTA, placed under the control of a 

is strong constitutive promoter, the promoter of the rice actin 1 gene; and 

(ii) a second expression cassette comprising a nucleic acid 
encoding an ISUM2A polypeptide, placed under the control of a 
promoter containing a TRE unit, the promoter being activated when the 
tTA activator binds to the TRE element. 

20 Thus, in the absence of any repressor inducer compound, the 

activator compound tTA is produced constitutively and activates the 
promoter controlling the sequence encoding an ISUM2A polypeptide. 

On the other hand, in the presence of the repressor inducer 
compound, which, in this case, is tetracycline or a derivative of 

25 tetracycline, the tTA activator is deactivated and no longer binds to the 
TRE unit of the promoter controlling the nucleic acid encoding an 
ISUM2A polypeptide. In this situation, in the presence of tetracycline, the 
ISUM2A polypeptide is no longer expressed. 

The repressor inducer signal is the absence of binding of the 

30 activator compound/repressor inducer compound (tTA 
activator/tetracycline) complex to the repressor regulatory polynucleotide 
(promoter containing the TRE unit). 

Preferably, an expression system II as defined above will be 
used in a plant cellular host or in a plant for which the two copies of the 

35 isum2A gene are mutated or inactivated. 
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Expression system III 

The expression system III, which is summarized in Table 3, is 
very similar, with regard to the elements regulating the expression of the 
5 expression cassettes which it contains, to the expression system I 
above. 

With the expression system III, it is possible to control the 
expression of an antisense polynucleotide capable of inhibiting the 
production of an ISUM2A polypeptide in a plant comprising at least one 
10 functional copy of the isum2A gene, preferably in a plant comprising the 
two functional copies of the isum2A gene. 

The expression system III comprises two expression cassettes, 
respectively: 

(i) an expression cassette containing a nucleic acid encoding an 
is antisense polynucleotide, or RNAi, which inhibits the translation of the 

ISUM2A polypeptide, this nucleic acid being placed under the control of 
a promoter which is activated by a given activator compound, when this 
activator compound is bound to an inducer compound; and 

(ii) an expression cassette comprising a nucleic acid encoding 
20 the activator compound, this nucleic acid being placed under the control 

of a constitutive or embryo-specific promoter, preferably a strong 
promoter. 

According to the expression system III, in the absence of 
binding of the activator compound to the inducer compound, the 
25 promoter controlling the expression of the antisense polynucleotide, or 
RNAi, is inactive and the antisense polynucleotide is not produced. 

On the other hand, in the presence of the inducer compound 
capable of activating the activator compound, for example by forming a 
complex with it, the activated activator compound binds to the promoter 
30 controlling the expression of the isum2A antisense polynucleotide, which 
activates the expression of the antisense polynucleotide, leading to 
inhibition of the translation of the isum2A gene. 

By virtue of the expression system III, the synthesis of the 
ISUM2A polypeptide is therefore inhibited in the presence of the inducer 
35 compound which activates the activator compound. 
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Such an expression system makes it possible to precisely 
control the moment at which it is desired to block the synthesis of the 
ISUM2A polypeptide, by bringing the deactivated activator compound 
into contact with the inducer compound which activates the activator 
5 compound, for example from the beginning of pollination. 

An illustrative example of an expression system III may be 
directly derived from that described in Example 3, when the nucleic acid 
encoding an ISUM2A polypeptide is substituted with a nucleic acid 
encoding an antisense polynucleotide which inhibits the translation of the 
10 ISUM2A polypeptide or any other method of inhibition of endogenous 
genes using a transgene and known to those skilled in the art 
(cosuppression, ribozyme, double-stranded RNA, etc). 

The inducible systems for the expression of an ISUM2A 
polypeptide comprise several expression cassettes which may 
15 indifferently be included in a single expression vector or, on the other 
hand, in different expression vectors. 

An inducible expression system such as the systems I, II and III 
described above advantageously comprises at least one selection 
marker gene, such as, for example, a gene for resistance to the 
20 herbicide BASTA, well known to those skilled in the art. 

According to a first embodiment, the selection marker gene is 
carried by the vector comprising the expression cassette(s) constituting 
the inducible expression system. 

According to a second embodiment, the selection marker gene 
25 is carried by a vector other than a vector comprising the expression 
cassette(s) constituting the inducible expression system. 

To prepare a nucleic acid construct of the invention, use will be 
made of an inducible activator regulatory polynucleotide chosen from 
those described below. 

30 

Preferred inducible regulatory polynucleotides according to the 
invention 

The regulatory sequence capable of controlling the nucleic acid 
35 encoding an ISUM2A polypeptide according to the invention may be a 



39 



regulatory sequence which can be induced by a particular metabolite, 
such as: 

- a glucocorticoid-inducible regulatory sequence, as described 
by AOYAMA et al. (1997) or as described by McNELLYS et al. (1998); 

- an ethanol-inducible regulatory sequence, such as that 
described by SALTER et al. (1998) or else as described by CADDICK et 
al. (1998); 

- a tetracycline-inducible regulatory sequence, such as that sold 
by the company CLONTECH; 

- a promoter sequence which can be induced by a pathogen or 
by a metabolite produced by a pathogen; 

- a PR-type gene regulatory sequence, which can be induced by 
salicylic acid or BTH or Aliette (Gorlach et al., 1996, Molina et al., 1998); 

- a regulatory sequence of the Ecdysone receptor type 
(Martinez et al., 1999) which can be induced by tebufenozide (product 
reference RH5992, sold by ROHM & HAAS) for example, belonging to 
the di benzoyl hydrazine family. 
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Other sequences containing regulatory signals 

Advantageously, the nucleic acid allowing synthesis of the 
ISUM2A polypeptide may also contain one or more other sequences 
5 containing regulatory signals for the expression of the region encoding 
ISUM2A, or alternatively may be placed under the control of such 
regulatory sequences. 

"Leader" sequences 

10 

The other sequences containing regulatory signals encompass 5' 
untranslated sequences referred to as "leader" sequences. Such 
sequences may increase the translation of the mRNA encoding ISUM2A. 
Among said sequences, mention may be made, by way of nonlimiting 
15 illustration, of: 

- the EMCV leader (EncephaloMyoCarditis VIRUS 5' noncoding 
region) (ELROY-STEIN et al., 1989); 

- the TEV (Tobacco Etch Virus) leader (CARRINGTON AND 
FREED, 1990); 

20 - the leader of the BiP gene encoding the human 

immunoglobulin heavy chain-binding protein (MACEJACK et al., 1991); 

- the AMV RNA 4 leader from the mRNA of the alfalfa mosaic 
virus protein (JOBLING et al., 1987); 

- the tobacco mosaic virus leader (GALLIE et al., 1989). 

25 

"Terminator" sequences 

The nucleic acid allowing synthesis of the ISUM2A polypeptide, 
or the vector into which this nucleic acid is inserted, may also comprise 
30 "terminator" sequences. 

Among the terminators which can be used in the constructs of 
the invention, mention may be made, by way of nonlimiting illustration, 
of: 

- the 35S polyA of the cauliflower mosaic virus (CaMV), 
35 described in the article by FRANCK et al. (1980); 
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- the nos terminator corresponding to the 3' noncoding region of 
the nopaline synthase gene of the Agrobacterium tumefaciens Ti 
plasmid (DEPICKER et al., 1992); 

- the histone gene terminator (EP 0 633 317). 

5 According to another alternative, the expression cassette 

according to the invention may comprise a polynucleotide encoding an 
ISUM2A polypeptide fused to a gene regulatory sequence of the 
glucocorticoid receptor GR fragment type (Aoyma et al. 1997), said 
polynucleotide being placed under the control of a promoter sequence of 

10 the native ISUM2 promoter or constitutive promoter type. In the presence 
of the hormone, the resulting hybrid protein is no longer retained in the 
cytoplasm and can therefore enter into the chloroplast and integrate into 
the ribosome. 

15 RECOMBINANT VECTORS OF THE INVENTION 

A nucleic acid which allows the synthesis of the ISUM2A 
polypeptide can be inserted into a suitable vector. 

For the purpose of the present invention, the term "vector" will 
be intended to mean a circular or linear DNA or RNA molecule which is 
20 indifferently in single-stranded or double-stranded form. 

A recombinant vector according to the invention is preferably an 
expression vector, or more specifically an insertion vector, a 
transformation vector or an integration vector. 

It may in particular be a vector of bacterial or viral origin. 
25 In all cases, the nucleic acid which allows the synthesis of the 

ISUM2A polypeptide is placed under the control of one or more 
sequences containing signals regulating its expression in the plant under 
consideration, whether the regulatory signals are all contained in the 
nucleic acid encoding ISUM2A, as is the case in the nucleic acid 
30 constructs described in the preceding section, or whether one or more of 
them, or even all the regulatory signals, are contained in the recipient 
vector into which the nucleic acid encoding ISUM2A has been inserted. 

A recombinant vector according to the invention 
advantageously comprises suitable transcription initiation and stop 
35 sequences. 
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In addition, the recombinant vectors according to the invention 
may include one or more origins of replication which are functional in the 
host cells in which their expression is desired, and also, where 
appropriate, selection marker nucleotide sequences. 
5 The recombinant vectors according to the invention may also 

include one or more expression-regulating signals as defined above in 
the description. 

The bacterial vectors which are preferred according to the 
invention are, for example, the pBR322 (ATCC No. 37 017) vectors or 
10 else the vectors such as pAA223-3 (Pharmacia, Uppsala, Sweden) and 
pGEM1 (Promega Biotech, Madison, Wl, United States). 

Mention may also be made of other vectors which are 
commercially available, such as the vectors pQE70, pQE60, pQE9 
(Quiagen), psiX174, pBluescript SA, pNH8A, pMH16A, pMH18A, 
is pMH46A, pWLNEO, pSV2CAT, pOG44, pXTI and pSG (Stratagene). 

They may also be vectors of the Baculovirus type, such as the 
vector pVL1 392/1 393 (Pharmingen) used for transfecting cells of the Sf9 
line (ATCC No. CRL 1711) derived from Spodoptera frugiperda. 

Preferably, and for the main application of the vectors of the 
20 invention consisting in obtaining stable and preferably inducible 
expression of a sequence encoding an ISUM2A polypeptide in a plant, 
use will be made of vectors which are especially suitable for the 
expression of sequences of interest in plant cells, such as the following 
vectors: 

25 - vector pBIN19 (BEVAN et al.), sold by the company 

CLONTECH (Palo Alto, California, USA); 

- vector pBI 101 (JEFFERSON, 1987), sold by the company 
CLONTECH; 

- vector pBI121 (JEFFERSON, 1987), sold by the company 
30 CLONTECH; 

- vector pEGFP; Yang et al. (1996), sold by the company 
CLONTECH; 

- vector pCAMBIA 1302 (HAJDUKIIEWICZ et al., 1994); 
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- intermediate and superbinary vectors derived from the vectors 
pSB12 and pSB1 described by Japan Tobacco (EP 672 752 and Ishida 
et aL, 1996). 

Also part of the invention are the vectors pRDPS and pRDP4 
5 described in Exemple 2, and also the vectors pRDP2 and pRDP3 
described in Example 3, with all their characteristics defined in these 
examples. 

TRANSFORMED HOST CELLS ACCORDING TO THE INVENTION. 

10 

In order to allow the expression of a nucleic acid encoding an 
ISUM2A polypeptide according to the invention, placed under the control 
of a suitable regulatory sequence, the recombinant nucleic acids or 
vectors defined in the present description must be introduced into a host 
is cell. The introduction of the polynucleotides according to the invention 
into a host cell may be carried out in vitro, according to techniques well 
known to those skilled in the art. 

A subject of the invention is also a host cell transformed with a 
nucleic acid according to the invention or with a recombinant vector as 
20 defined above. 

Such a transformed host cell is preferably of bacterial, fungal or 
plant origin. 

Thus, bacterial cells of various strains of Escherichia coli or else 
of Agrobacterium tumefaciens may in particular be used. 

25 Advantageously, the transformed host cell is a plant cell or else 

a plant protoplast. 

Among the cells which can be transformed according to the 
method of the invention, mention may be made, by way of examples, of 
cells of large crop plants (maize, wheat, rapeseed, sunflower, pea, 

30 soybean, barley, etc.). Preferably, plants known to contain large (protein, 
carbohydrate and lipid) stores, in particular cereal plants or oil-yielding 
plants, may be chosen. 

The hybrid plants obtained by crossing plants according to the invention 
are also part of the invention. 
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Preferably, it is a cell or a protoplast of a cereal plant. The cell 
or the protoplast preferably comes from maize, wheat, barley, sorghum, 
millet, rye or rice. 

Entirely preferably, it is a cell from maize. 
5 A subject of the invention is also the use of a nucleic acid 

comprising a polynucleotide encoding an ISUM2A polypeptide, where 
appropriate in the form of a nucleic acid construct as defined above, for 
producing a transformed plant capable of producing grains with altered 
germ development. 

10 The invention also relates to the use of a recombinant vector as 

defined in the present description for producing a transformed plant 
capable of producing grains with altered germ development. 

The invention also relates to the use of a cellular host transformed 
with a nucleic acid comprising a polynucleotide encoding an ISUM2A 

is polypeptide, where appropriate in the form of a nucleic acid construct or 
expression cassette as defined above, for producing a transformed plant 
capable of producing seeds with altered germ development. 

The invention also relates to a transformed plant comprising a 
plurality of host cells as defined above. 

20 

PLANTS TRANSFORMED WITH A NUCLEIC ACID WHICH ALLOWS 
THE SYNTHESIS OF THE ISUM2A POLYPEPTIDE AND METHODS 
FOR OBTAINING THEM 

25 The invention also relates to a transformed multicellular plant 

organism, characterized in that it comprises a host cell, or a plurality of 
host cells, transformed with a nucleic acid comprising a polynucleotide 
encoding the ISUM2A polypeptide as defined in the present description, 
or else with a recombinant vector comprising such a nucleic acid. 

30 A subject of the invention is also a transformed plant 

comprising, in a form artificially integrated into its geinome, a nucleic acid 
which allows the synthesis of the ISUM2A polypeptide, as defined in the 
present description. 

The transformed plant may contain a plurality of copies of a 

35 nucleic acid encoding the ISUM2A polypeptide, in situations in which 
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overexpression of the ISUM2A polypeptide is sought. Overexpression of 
the ISUM2A polypeptide is sought in particular when it is desired to 
obtain plants producing grains in which the germ is significantly larger in 
size than in the "wild-type" plants, and which is enriched in oil. 
5 According to another aspect, the overexpression of the ISUM2A 

polypeptide can be obtained by transforming a host cell or a plant with a 
nucleic acid encoding the ISUM2A polypeptide and in which the 
polynucleotide comprising the open reading frame is placed under the 
control of a regulatory nucleic acid which allows a high level of 
10 transcription of the corresponding mRNA or a high level of translation of 
the ISUM2A polypeptide in the host cell or in the plant. 

The invention therefore also relates to a transformed plant as 
defined above, the grains of which are rich in oil. 

It also relates to a transformed plant as defined above, which 
15 has improved agronomic and/or nutritional qualities. 

It also relates to a method for obtaining such a transformed 

plant. 

Among the plants which can be transformed according to the 
invention, mention may be made, by way of nonlimiting illustrative 
20 example, of large crop plants, preferably maize, wheat, rapeseed, 
sunflower, pea, soybean and barley. 

Given the aim pursued by the invention, which consists mainly 
in obtaining starch-rich grains with altered germ development, the 
preferred plants according to the invention are those for which the grains 
25 have a high starch content or a large amount of starch, and entirely 
preferably maize, wheat, barley, sorghum, millet, rye or rice. 

The hybrid plants obtained by crossing transformed plants 
according to the invention are also part of the invention. 

The invention also relates to any part of a transformed plant as 
30 defined in the present invention, such as the root, but also the aerial 
parts, for instance the stem, the leaf, the flower and especially the grain. 

A subject of the invention is also a plant seed or grain produced 
by a transformed plant as defined above. Typically, such a transformed 
seed or such a transformed grain comprises one or more cells 
35 comprising, in their genome, one or more copies of a nucleic acid which 
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allows the synthesis of the ISUM2A polypeptide, where appropriate in a 
controlled and inducible manner. 

When overexpression of the ISUM2A polypeptide has been 
obtained in the plant, the plant seeds or grains are enriched in oil. A 
5 subject of the invention is therefore seeds and grains enriched in oil, 
prepared or obtained from a transformed plant overexpressing the 
ISUM2A polypeptide. 

The grains rich in oil are used for preparing seeds or seed 
meals enriched in oil, which can be used in agriculture and in the 
10 agrofoods industry. 

According to a preferred embodiment of a transformed plant 
according to the invention, controlled expression of the ISUM2A 
polypeptide is sought, which implies that the transformed plant contains, 
as functional copy of a polynucleotide encoding the ISUM2A polypeptide, 
15 only the copy or copies which has or have been artificially introduced into 
their cells, and preferably into their genome, while the sequences of the 
isum2A gene that are found naturally in the wild-type plant carry at least 
one mutation causing a deficiency in expression of the isum2A gene. 

Such plants mutated in the isum2A gene are, for example, the 
20 G2422 mutant plants or the emb*-8516 mutant plants described by 
HECKEL et al. (1999). Those skilled in the art can, by virtue of the 
invention, produce other plants mutated in the isum2A gene, for example 
by random insertion of the Mutator transposon into a population of plants 
of wild-type phenotype (lsum2A + /lsum2A + ), and then detection in the 
25 mutants obtained of those among these mutants which no longer 
express the isum2A gene, for example using the nucleotide probes or 
primers described in the examples. 

According to this preferred embodiment, the transformed plant 
according to the invention is characterized in that it derives from a plant 
30 in which the two copies of the isum2A gene each carry at least one 
mutation causing a deficiency in its expression, at the transcriptional or 
translational level. 

According to a second preferred embodiment according to the 
invention, controlled inhibition of the synthesis of the ISUM2A 
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polypeptide, for example through the expression of an antisense 
polynucleotide, is sought. 

In this case, the transformed plant comprises at least one 
functional copy of the isum2A gene, and preferably the two copies of the 
5 isum2A gene are functional. 

The invention also relates to a method for obtaining a 
transformed plant capable of producing grains altered in germ 
development, characterized in that it comprises the following steps: 

a) transforming at least one plant cell; 

10 - with a nucleic acid comprising a polynucleotide encoding an 

ISUM2A polypeptide, chosen from the sequences having at least 95% 
amino acid identity with the sequences SEQ ID No. 5 and SEQ ID No. 6, 
said nucleic acid also comprising a regulatory polynucleotide as defined 
in the present description; or 

15 - with a recombinant vector comprising such a nucleic acid; 

b) selecting the cells transformed in step a) which have 
integrated into their genome at least one copy of a nucleic acid encoding 
the ISUM2A polypeptide; 

c) regenerating a transformed plant from the transformed cells 
20 obtained in step b). 

According to the invention, the expression "grains altered in 
germ development" is intended to mean grains in which the development 
of the germ is "abnormal", i.e. differs significantly from the germ of a 
"wild-type" plant grain. 

25 According to a first aspect, the germ development may be 

altered such that the size of the germ is significantly greater than that of 
the germ found in the grains of "wild-type" plants not altered in isum2A 
gene expression. A significantly increased size of the germ can be 
obtained by overexpressing the ISUM2A polypeptide, for example either 

30 by introducing a plurality of copies of a nucleic acid encoding this 
polypeptide into a host cell or into a plant, or by placing a copy of the 
isum2A gene or of its cDNA under the control of a regulatory 
polynucleotide which allows overexpression of the ISUM2A polypeptide 
in the host cell or in the plant. 
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By overexpressing the ISUM2A polypeptide from the early stage 
of embryo development, large grains rich in oil are obtained. 

A subject of the invention is also a particular embodiment of the 
method above for obtaining a transformed plant capable of producing 
5 grains with altered germ development, in which at least one plant cell is 
transformed in step a) with Agrobacterium tumefaciens containing: 

- a nucleic acid comprising a polynucleotide encoding an 
ISUM2A polypeptide, chosen from the sequences having at least 95% 
amino acid identity with the sequences SEQ ID No. 5 and SEQ ID No. 6 

10 and comprising a regulatory polynucleotide as defined in the present 
description; or 

- a recombinant vector comprising such a nucleic acid. 

Each of the methods for obtaining a transformed plant 

according to the invention may comprise the following additional steps: 
15 d) crossing a plant selected in step c) with a heterozygous plant 

comprising a functional copy of the isum2A gene and an inactive copy of 

the isum2A gene; 

e) selecting the plants derived from the cross of step d) which 

are homozygous and carry two inactive copies of the isum2A gene. 
20 Preferably, the regulatory polynucleotide is sensitive to the 

action of an inducer signal; the regulatory polynucleotide is also referred 

to as inducible. 

According to a first preferred embodiment, the inducible 
regulatory polynucleotide consists of a "repressor" regulatory 
25 polynucleotide as defined in the present description. 

According to a second preferred embodiment, the inducible 
regulatory polynucleotide is a transcription- or translation-"activating" 
regulatory polynucleotide as defined in the present description. 

The present invention also relates to a transformed plant as 
30 obtained by one of the methods of production defined above. 

The invention also relates to a hybrid transgenic plant obtained 
by crossing a transformed plant as defined above. 

The invention also relates to a part of a transformed plant 
according to the invention. 
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A subject of the invention is also a method for obtaining plant 
grains with altered germ development, characterized in that it comprises 
the following steps: 

a) cultivating, until pollination, a plant in which the two copies of 
5 the isum2A gene carry at least one mutation causing a deficiency in the 

production of the ISUM2A polypeptide, and into the genome of which 
has been artificially introduced a nucleic acid comprising: 

- a polynucleotide encoding an ISUM2A polypeptide, chosen 
from the sequences having at least 95% amino acid identity with the 

10 sequences SEQ ID No. 5 and SEQ ID No. 6; and 

- an inducible regulatory polynucleotide of the repressor type 
controlling the expression of the polynucleotide encoding the ISUM2A 
polypeptide, 

the cultivating of the plant being carried out in the absence of the 
15 repressor inducer signal to which the repressor regulatory polynucleotide 
is sensitive; 

b) bring the transformed plant defined in a) into contact with the 
repressor inducer signal to which the repressor regulatory polynucleotide 
is sensitive, for a period of time ranging from pollination to the end of 

20 grain formation; 

c) recovering the mature grains, characterized in that they are 
altered in germ development. 

The germ development can be altered such that the size of the 
germ is significantly decreased compared to that of the grains originating 

25 from wild-type plants, or even inexistent, as has been shown when the 
plant has a nonfunctional isum2A gene or else when a nucleic acid 
encoding the ISUM2A polypeptide under the control of a regulatory 
polynucleotide which makes it possible to block the transcription or 
translation in a controlled manner in order to alter germ development is 

30 introduced. 

A subject of the present invention is also a method for obtaining 
plant grains with altered germ development, characterized in that it 
comprises the following steps: 
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a) cultivating, until pollination, a plant in which the two copies of 
the isum2A gene each carry at least one mutation causing a deficiency in 
the production of the ISUM2A polypeptide, and into the genome of which 
has been artificially introduced a nucleic acid comprising: 

5 - a polynucleotide encoding an ISUM2A polypeptide, chosen 

from the sequences having at least 95% amino acid identity with the 
sequences SEQ ID No. 5 and SEQ ID No. 6; and 

- a regulatory polynucleotide of the inducible activator type 
controlling the expression of the polynucleotide encoding the ISUM2A 

10 polypeptide, 

the cultivating of the plant being carried out in the presence of the 
activator inducer signal to which the activator regulatory polynucleotide is 
sensitive; 

b) continuing the cultivation of the transformed plant defined in 
15 a) in the absence of the activator induced signal to which the activator 

regulatory polynucleotide is sensitive, from the period following 
pollination; 

c) recovering the mature grains, characterized in that they are 
altered in germ development. 

20 The invention also relates to a seed with altered germ 

development as obtained according to one of the methods defined 
above. 

The invention also relates to a seed with altered germ 
development, characterized in that each of its constitutive cells 
25 comprises, in a form artificially integrated into their genome, a nucleic 
acid comprising: 

- a polynucleotide encoding an ISUM2A polypeptide, chosen 
from the sequences having at least 95% amino acid identity with the 
sequences SEQ ID No. 5 and SEQ ID No. 6; and 

30 - a regulatory polynucleotide. 

According to a first embodiment, the regulatory polynucleotide 
consists of an inducible regulatory polynucleotide of the repressor type. 

According to a second preferred embodiment, the regulatory 
polynucleotide consists of a regulatory polynucleotide of the inducible 
35 activator type. 
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Also part of the invention is any product of transformation of the 
seed as defined in the present description, in particular a seed or an oil. 

Preferably, the product of transformation is a starch. 

In order to obtain starch from a seed with altered germ 
5 development according to the invention, those skilled in the art will 
advantageously make use of the techniques described in the work 
"Handbuch der Starke" (vol. I; Max Ullmann (ed.), Paul Varey Verlag, 
Berlin) or else in the article by Morrison and Karkalas (Methods in Plant 
Biochemistry, 1990, vol.2: 323-352, Academic Press Ltd; London). 
10 The starch obtained from the seeds with altered germ 

development according to the invention may be used by the agrofoods 
industry, the pharmaceutical industry or the paper industry, or else in the 
microbiological field, where it can be used as a nutritive substrate. 

The transformation of plant cells can be carried out by the 
15 techniques known to those skilled in the art. 

Mention may in particular be made of the methods of direct 
gene transfer, such as direct microinjection into plant embryoids 
(NEUHAUS et al., 1987), infiltration under vacuum (BECHTOLD et al„ 
1993) or electroporation (CHUPEAU et al., 1989) or else direct 
20 precipitation by means of PEG (SCHOCHER et al., 1986) or 
bombardment, using a particle gun, of particles coated with the plasmid 
DNA of interest (FROMM M. et al. 1990). 

It is also possible to infect the plant with a bacterial strain, in 
particular of Agrobacterium. According to one embodiment of the method 
25 of the invention, the plant cells are transformed with a vector according 
to the invention, said cellular host being capable of infecting said plant 
cells by allowing integration into the genome thereof of the nucleotide 
sequences of interest initially contained in the DNA of the 
abovementioned vector. Advantageously, the abovementioned cellular 
30 host used is Agrobacterium tumefaciens, in particular according to the 
method described in the article by AN et al., (1986), or else 
Agrobacterium rhizogenes, in particular according to the method 
described in the article by GUERCHE et al., (1987) or else in PCT 
application No. WO OO 22.148. 
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For example, the transformation of the plant cells can be carried 
out by transferring the T region of the tumor-inducing extrachromosomal 
circular plasmid Ti from Agrobacterium tumefaciens, using a binary 
system (WATSON et al. 1994). To do this, two vectors are constructed. 
5 In one of these vectors, the T region has been removed by deletion, with 
the exception of the right and left borders, and between them the gene of 
interest and also a marker gene are inserted, so as to allow selection in 
the plant cells. The other partner of the binary system is an auxiliary 
plasmid Ti, which modified plasmid no longer has a T region but still 
10 contains the vir virulence genes required for transformation of the plant 
cell. 

According to a preferred embodiment, use may be made of the 
method described by ISHIDA et al. (1996) for the transformation of 
monocotyledons. 

15 According to another protocol, the transformation is carried out 

according to the method described by FINER et al. (1992) using a 
particle gun with tungsten or gold particles. 

Those skilled in the art are capable of implementing many 
methods of the state of the art in order to obtain plants transformed with 

20 a nucleic acid which allows synthesis of the ISUM2A polypeptide. 

Those skilled in the art may advantageously refer to the 
technique described by BECHTOLD et al. (1993) in order to transform a 
plant using the bacterium Agrobacterium tumefaciens. Techniques using 
other types of vectors may also be used, such as the techniques used by 

25 BOUCHEZ et al. (1993) or else by HORSCH et al. (1994). 

By way of illustration, a transgenic plant according to the 
invention may be obtained by biolistic techniques such as those 
described by FINER et al. (1992) or else those described by VAIN et al. 
(1993). 

30 Other preferred techniques for transforming a plant in 

accordance with the invention with Agrobacterium tumefaciens are those 
described by ISHIDA et al. (1996) or else in the PCT application 
published under the No. WO 95/06 722 in the name of JAPAN 
TOBACCO. 

35 
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POLYPEPTIDES ENCODED BY THE ISUM2A GENE 

As already described above, the isum2A gene encodes a 
polypeptide 143 amino acids in length, for which it has been possible to 
observe at least two variant polypeptides. 

The first variant polypeptide encoded by the isum2A gene has 
the amino acid sequence SEQ ID No. 5 and is encoded by the isum2A 
gene present in the genome of the maize plant denoted HD5x HD7. 

The second variant ISUM2A polypeptide is encoded by the 
isum2A gene present in the genome of the maize plant denoted A188. 

The ISUM2A polypeptides of sequences SEQ ID No. 5 and 
SEQ ID No. 6 differ only by virtue of the substitution of an amino acid 
residue, the polypeptide of sequence SEQ ID No. 5 having a glycine 
residue at position 89 and the polypeptide of sequence SEQ ID No. 6 
having an asparagine amino acid residue at this position. 

The expression of one or other of the ISUM2A polypeptide 
variants leads, in all cases, to the expression of a wild-type phenotype in 
the plant, said plant producing mature, fertile grains comprising a 
completely developed embryo. 

A homology search in the PROSITE database has made it 
possible to show structural similarities between the ISUM2A polypeptides 
according to the invention and the chloroplast L35 proteins. 

The ISUM2A polypeptide of amino acid sequence SEQ ID No. 5 
has a calculated molecular weight of 15112 daltons, a calculated 
isoelectric point of 1 1 .75 and a charge at pH 7 of 28.19. 

The ISUM2A polypeptide of sequence SEQ ID No. 5 has 40 
charged amino acid residues (R, K, H, Y, C, D, E), 3 acidic amino acid 
residues (D, E), 31 basic amino acid residues (K, R), 31 polar amino acid 
residues (N, C, Q, S, T, Y) and 55 hydrophobic amino acid residues (A, I, 
L, F, W,V). 

Given the presence of a single substitution of an amino acid 
residue, with respect to the sequence SEQ ID No. 5, the ISUM2A 
polypeptide of sequence SEQ ID No. 6 has characteristics very similar to 
those described above for the ISUM2A polypeptide of sequence SEQ ID 
No. 5. 
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A subject of the invention is therefore also the polypeptide 
comprising the amino acid seqence SEQ ID No. 5 or SEQ ID No. 6 and 
also a polypeptide having at least 95% amino acid identity with the 
sequence SEQ ID No. 5 or SEQ ID No. 6, or a fragment or a variant 
5 thereof. 

A fragment of an ISUM2A polypeptide according to the 
invention comprises at least 10, 15, 20, 25, 30, 40, 50, 60, 70, 80, 90, 
100, 120, 130, 135 or 140 consecutive amino acids of a polypeptide of 
sequence SEQ ID No. 5 or SEQ ID No. 6. 
10 Also part of the invention is a polypeptide comprising 10, 15, 

20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 120, 130, 135 or 140 consecutive 
amino acids of a polypeptide of sequence SEQ ID No. 5 or SEQ ID No. 
6. 

The invention also relates to a polypeptide comprising an amino 
15 acid sequence having at least 95% amino acid identity with the sequence 
of an ISUM2A polypeptide of sequence SEQ ID No. 5 or SEQ ID No. 6. 

Advantageously, also part of the invention is a polypeptide 
having at least 96%, 97%, 98%, 99%, 99.1%, 99.2%, 99.3%, 99.4%, 
99.5%, 99.6%, 99.7%, 99.8% or 99.9% amino acid identity with the 
20 sequence of a polypeptide of sequence SEQ ID No. 5 or SEQ ID No. 6, 
or a peptide fragment thereof. 

In general, the polypeptides according to the present invention 
are in an isolated or purified form. 

Another subject of the invention consists of a polypeptide 
25 comprising amino acid modifications of 1, 2, 3, 4 or 5 substitutions, 
additions or deletions of an amino acid compared to the amino acid 
sequence of a polypeptide of sequence SEQ ID No. 5 or SEQ ID No. 6 
or else of a fragment or of a variant thereof. 

A polypeptide according to the invention may be obtained by 
30 genetic recombination according to techniques well known to those 
skilled in the art, for example techniques described in AUSUBEL et al. 
(1989). 

A polypeptide according to the invention may also be prepared 
by conventional techniques of chemical synthesis, equally in 
35 homogeneous solution or in solid phase. 
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By way of illustration, a polypeptide according to the invention 
may be prepared by the homogeneous solution technique described by 
HOUBEN WEIL (1974) or else by the solid-phase synthesis technique 
described by MERRIFIELD (1965a; 1965b). 
5 Preferably, the polypeptides which are variants of a polypeptide 

according to the invention conserve their ability to be recognized by 
antibodies directed against the polypeptides of sequence SEQ ID No. 5 
or 6. 

A polypeptide encoded by the lsum2A gene according to the 

10 invention, such as a polypeptide of amino acid sequence SEQ ID No. 5 
or 6, or else a variant or a peptide fragment thereof, is useful in particular 
for preparing antibodies intended for the detecting the presence and/or 
the expression of a polypeptide of sequence SEQ ID No. 5 or 6 or of a 
peptide fragment thereof, in a sample. 

15 Besides the detection of the presence of a polypeptide encoded 

by the lsum2A gene or else of a peptide fragment of such a polypeptide, 
in a sample, antibodies directed against these polypeptides are used for 
quantifying the synthesis of a polypeptide of sequence SEQ ID No. 5 or 
6, for example in cells of a plant, and thus determine the development of 

20 the embryo. 

For the purpose of the present invention, the term "antibodies" 
will be intended to mean in particular polyclonal or monoclonal antibodies 
or fragments (for example F(ab)' 2 , F(ab) fragments) or else any 
polypeptide comprising a domain of the initial antibody, which recognizes 

25 the target polypeptide or the target polypeptide fragment according to the 
invention. 

Monoclonal antibodies may be prepared from hybridomas 
according to the technique described by KOHLER and MILSTEIN (1975). 

The present invention also relates to antibodies directed against 
30 a polypeptide as described above or a fragment or a variant thereof, as 
produced in the trioma technique or else the hybridoma technique 
described by KOZBOR et al. (1983). 

The invention also relates to single-chain Fv (ScFv) antibody 
fragments as described in US No. 4,946,778 or else by MARTINEAU et 
35 al. (1998). 
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The antibodies according to the invention also comprise 
antibody fragments obtained by means of phage libraries as described 
by RIDDER et al. (1995) or else humanized antibodies as described by 
REINMANN et al. (1997) and LEGER et al. (1997). The preparations of 
5 antibodies according to the invention are useful in immunodetection 
assays intended to identify the presence and/or or the amount of a 
polypeptide of sequence SEQ ID No. 5 or 6, or of a peptide fragment 
thereof, present in a sample. 

An antibody according to the invention may also comprise an 
10 isotopic or nonisotopic detectable label, for example a fluorescent label, 
or else may be coupled to a molecule such as biotin, according to 
techniques well known to those skilled in the art. 

Thus, a subject of the invention is also a method for detecting 
the presence of a polypeptide in accordance with the invention in a 
15 sample, said method comprising the steps of: 

a) bringing the test sample into contact with an antibody as 
described above; 

b) detecting the antigen/antibody complex formed. 

The invention also relates to a diagnostic pack or kit for 
20 detecting the presence of a polypeptide in accordance with the invention 
in a sample, said pack comprising: 

a) an antibody as defined above; 

b) where appropriate, one or more reagents required for the 
detection of the antigen/antibody complex formed. 

25 Another subject of the invention consists of the use of a nucleic 

acid or of an allelic variant of a nucleic acid as defined above, in 
selection programs for obtaining plants with modified embryo size and/or 
development influencing the starch and/or oil content. 

The invention also relates to a method for selecting plants with 
30 modified embryo size and/or development, comprising the steps of: 

a) genotyping plants (individuals) using nucleotide probes or 
primers obtained from the nucleic acids as defined above or variants of 
these nucleic acids; 
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b) selecting, from these plants (individuals), those which 
comprise a high frequency of favorable alleles associated with the size 
and/or development of the embryo. 

The present invention is also illustrated, without however being 
5 limited, by the following figures and examples: 

FIGURES 

Figure 1 illustrates the genomic nucleotide sequence of the 
isum2A gene present in the maize plant denoted HD5 x HD7 referenced 
as the sequence SEQ ID No. 1 of the sequence listing. The three exons 
10 of the isum2A gene are represented in the form of boxes, under the 
corresponding nucleotide sequence. The various recognition sites for 
restriction endonucleases are indicated, above the corresponding 
nucleotide sequence. 

Figure 2 illustrates a nucleotide sequence of the cDNA of the 
15 isum2A gene in the maize plant denoted HD5 x HD7, this sequence 
being referenced as the sequence SEQ ID No. 3 of the sequence listing. 

The sequences derived from the three exons of the gene are 
represented by boxes located under the corresponding nucleotide 
sequence. 

20 The deduced amino acid sequence is represented below the 

boxes representing the exons. 

The box denoted "RL35 domain" corresponds to the portion of 

the sequence of the ISUM2A polypeptide which is homologous with 

certain chloroplast proteins. The ISUM2A polypeptide represented in 
25 Figure 2 is referenced as the sequence SEQ ID No. 5 of the sequence 

listing. 

Figure 3 illustrates a partial cDNA sequence corresponding to 
the messenger RNA found in the maize plant denoted A188, this 
nucleotide sequence being referenced as the sequence SEQ ID No. 4 of 
30 the sequence listing. 

The deduced sequence of the protein encoded by the cDNA is 
represented below the corresponding nucleotide sequence, and is 
referenced as the sequence SEQ ID No. 6 of the sequence listing. 

The boxes located below the peptide sequence represent the 
35 three exons derived from the isum2A gene, found in the maize line A188. 
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The box denoted "RL35 domain" corresponds to the portion of 
the ISUM2A polypeptide having homology with chloroplast proteins. 

Figure 4 illustrates the exon and intron structure of the isum2A 
gene, the boxes representing the exons of the gene. The base A of the 
5 ATG codon is the nucleotide at position 51 of the first exon and the base 
T of the TGA translation stop codon is the nucleotide at position 76 of 
the third exon. The boxes located below the structure of the isum2A 
gene represent the various genomic DNA clones used in the examples. 
The boxes located above the structure of the isum2A gene represent the 
10 various cDNA clones used in the examples. 

Figure 5 illustrates a map of the plasmid pRDP5. 

Figure 6 illustrates the map of the plasmid pTRE sold by the 
company CLONTECH LABORATORIES Inc., the structure of which is 
accessible in the GENBANK database under the accession number 
15 U89931. 

Figure 7 illustrates the map of the plasmid pDM302 described 
by CAO etal. (1992). 

Figure 8 illustrates the map of the plasmid pTet-Off sold by the 
company CLONTECH (reference in catalog for year 2000: K1620-A). 
20 Figure 9 illustrates the map of the plasmid pRDP4. 

Figure 10 illustrates the map of the plasmid pTA7001 described 
by AOYAMA etal. (1997). 

Figure 11 illustrates the map of the plasmid pRDP2. 

Figure 12 illustrates the map of the plasmid pRDP3. 
25 Figure 13 illustrates the principle for the functioning of the 

expression system II summarized in Table 3 and which is the subject of 
Example 2. 

Figure 13A illustrates the constitutve expression of the ISUM2A 
polypeptide under the effect of activation of the promoter containing the 
30 TRE sequence by the tTA activator, which is itself expressed 
constitutively. 

Figure 13B illustrates the repression of the synthesis of the 
ISUM2A polypeptide when the tTA activator is brought into contact with 
tetracycline, which deactivates it and prevents it from activating the 
35 promoter containing the TRE sequence. 
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Figure 14 illustrates a scheme for the functioning of an 
expression system I summarized in Table 3 and which is the subject of 
Example 3. 

Figure 14A illustrates the synthesis of the ISUM2A polypeptide 
5 when the promoter containing the UAS sequence is activated by the 
GVG fusion protein, in the presence of a glucocorticoid. 

Figure 14B illustrates the absence of production of ISUM2A 
polypeptide in a situation in which the GVG fusion protein is produced in 
the absence of glucocorticoid and does not activate the UAS sequence 
10 of the promoter controlling the expression of the isum2A gene. 

EXAMPLES: 

EXAMPLE 1: Cloning the genomic sequence of the isum2A gene 
15 A. MATERIALS AND METHODS 

A1. Plant material 

Plants of the A188 line (Gerdes and Tracy, 1993) were used for 
the "genomic walk" and the isolation of RNA. As regards the DNA 
20 fragments contained in the genomic library which was screened, they 
originate from the hybrid line HD5xHD7 (Barloy et al., 1989). The mutant 
emb*-8516 and the recurrent parent Rscm2 were described in Heckel et 
al. (1999). 

25 A2. Cultivating of plants 

The plants were cultivated either in a climatic chamber, with a light 
period of 16 h (100 Wm-2), a relative humidity of 80% and a 24°C/19°C 
(day/night) alternation, or in a greenhouse under the same conditions but 
without humidity control, or in an open field in Lyons. All the plants were 

30 pollinated by hand. 

A3. Isolation of plant genomic DNA 

A young maize leaf 10 cm long was removed, placed in a 1.5 ml 
Eppendorf tube and ground with a suitable pestle, in liquid nitrogen. 
35 500 pi of extraction buffer (100 mM Tris-HCI, pH 8, 50 mM EDTA pH 8, 
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100 mM NaCI, 1% SDS) were added and the tube was incubated for 
5 min at 55°C. After two extractions with phenol/chloroform, the 
supernatant was precipitated using 50 pi of sodium acetate and 350 pi of 
isopropanol (cold) for 10 min at ambient temperature. The DNA pellet 
5 was rinsed with 1 ml of 70% ethanol, dried and taken up in 50 pi of 
TE10.01 (10 mM Tris-HCI, pH 8, 0.1 mM EDTA pH 8) supplemented with 
20 \jg/m\ of RNAse A. 

A4. PCR on genomic DNA 

io The genomic DNA was diluted 10-fold and 2 pi were used in a 

20 pi reaction containing 1 pM of each primer, 100 pM of dNTP and 1 u 
of Taq polymerase in the buffer provided with the enzyme (Pharmacia). 
The reaction was carried out in a Perkin Elmer DNA thermal Cycler 2400 
or 9700 PCR amplifier in the following way: after denaturation of the DNA 

is for 2 min at 94°C, the samples were subjected to 30 cycles of 
denaturation for 30 s at 94°C / hybridization for 45 s at 60°C / elongation 
for 1 min at 72°C. All the primers had Tm values between 60°C and 
62°C. 

20 AS. Cloning 

The plasmid DNA was prepared according to the alkaline lysis 
protocol described by Sambrook et al. (1989). The digestions of genomic 
DNA, of plasmid DNA or of lambda phage DNA with restriction enzymes 
were carried out according to the manufacturers' (Gibco, Boehringer 

25 Mannhein, Promega) recommendations. The products from digestion or 
from PCR amplification were separated by agarose gel electrophoresis in 
TAE buffer (Sambrook et al., 1989). The restriction fragments were 
ligated into the vector pBluescript and the PCR products were ligated 
into pGEM-T-easy, by means of T4-DNA ligase according to the 

30 supplier's (Promega) recommendations, using 50 ng of vector, and an 
insert/vector molar ratio of the order of 3/1. These constructs were 
introduced into the bacterial strain DH5-a by heat-shock transformation, 
according to the protocol of Hanahan (1983). 

35 A.6. DNA hybridization 
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Membranes for DNA hybridization were obtained either by 
transfer of agarose gel restriction fragments onto Hybond N+ 
membranes (Amersham) by capillarity, in the presence of 0.4 N sodium 
hydroxide, or by transfer of lambda phage lysis plaques onto Hybond N 
5 membranes (Amersham). The membranes were prehybridized according 
to the supplier's recommendations, for 4 to 12 h at 65°C in the presence 
of 5 x SSPE (20 x SSPE contains 3.6 M NaCI, 0.2 M sodium phosphate, 
EDTA, pH 7.7; 0.02 M). They were then hybridized for 16 h under the 
same conditions. The probes used for the hybridization were 

10 radioactively labeled with 50 pCi of a 32 P-dCTP, using the Ready-To- 
GO™ DNA labelling beads kit (Amersham). After hybridization, the 
membranes were subjected to four rinses, at 65°C in solutions 
containing 0.1% of SDS and SSC concentrations of 2 x, 1 x, 0.5 x and 
0.1 x (20 x SSC contains M NaCI 3 , 0.3 M Na 3 -citrate), and were exposed 

15 to films for autoradiography (X-OMAT, Kodak). 

A. 7. Genomic DNA library screening 

A genomic DNA library contained in the EMBL3 SP6/T7 lambda 
phage (Stratagene) was plated out at a density of 1.5x10 5 lysis plaques 

20 per dish, on 10 dishes, and then transferred onto Hybond N membrane 
according to the supplier's (Amersham) recommendations. After 
hybridization and autoradiography, the lysis plaques giving a signal were 
removed from the dishes and plated out at a lower density, and thus 
twice in a row in order to obtain a single clone per dish, in the form of 

25 isolated plates. The DNAs were then prepared according to Sambrook et 
al. (1989). 

A. 8. AIMS ("amplification of insertion mutagenized sites") 

The amplification of insertion mutagenized sites was obtained 
30 according to Frey et al. (1998). The Tru9A enzyme (Promega) was used 
for the digestion in place of Msel. The sequences of the "Mu-specific" 
and "Mu-nested" primers were degenerate at additional positions. The 
primers used, Mu15 (SEQ ID No. 14) and Mu16 (SEQ ID No. 8), are 
listed in Table 4. 



62 



A.9. "Genomic walk" 

The protocol of Devic et al. (1997) was applied without 
modification using the Dral, EcoRV, Pvull, Seal and Sspl enzymes. The 
PCR products were cloned into the vector pGEM®-T-Easy (Promega). 

5 

A. 10. RT-PCR 

Total RNA was extracted from 50 mg of ground tissue in liquid 
nitrogen with the TRIZOL™ reagent according to the supplier's (Gibco) 
protocol. The RNA, resuspended in 50 pi of Versol water (Aguettant), 

10 was denatured at 65°C for 5 min and treated with DNase I according to 
the supplier's (Promega) indications. It was then purified by two volume 
for volume extractions with phenol/chloroform, pH 4.5, and precipitation 
with ethanol. 6 pg of purified RNA were then "reverse" transcribed with 
MuLV reverse transcriptase (Gibco) in a total volume of 20 pi, as 

15 recommended by the supplier, with a polyT primer at a final 
concentration of 2.5 pM. After inactivation of the reverse transcriptase by 
heating at 95°C for 5 min, 2 pi of the reverse transcription reaction were 
amplified with Taq polymerase as described above. 

20 A.11. Sequencing 

The sequencing was carried out by the company Genome 
Express in Grenoble (France). The Sequencher 3.1.1 (Gene Codes 
Corporation) and DNAstar (Laser Gene) software made it possible to 
analyze and assemble the sequences obtained. 

25 

B. RESULTS 

B.1. Cloning of the sequences flanking the insertion of mutator in 
emb*-8516 

a) Cloning of a flanking sequence 

30 

As described in Heckel et al. (1999), a 107 bp AIMS product 
was obtained with the primers Mu16 (SEQ ID No. 8) and adapMsel (SEQ 
ID No. 9). This "AIMS product" (see Figure 4) contains 70 pb of the 
flanking sequence between the two primers, which does not show any 
35 significant homology with the sequences referenced in the databases. 
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b) Extension of the flanking sequence 

In order to extend this flanking sequence, the "genomic walk" 
5 (Devic et al., 1997) technique was used on the genomic DNA of the 
A188 genotype. With the nested primers GW4 (SEQ ID No. 12) and 
GW4b (SEQ ID No. 13) and cleavage with Sspl, a 244 bp product was 
obtained (clone L223a, Fig. 4), which contained an additional 153 bp of 
the flanking sequence. 
10 To characterize the sequence on the other side of the insertion, 

the "genomic walk" technique was used with the "nested" primers GW3 
(SEQ ID No. 10) and GW3b (SEQ ID No. 11) with Sspl cleavage. The 
1532 bp product (clone L223c, Fig. 4) contained 1463 bp of additioinal 
flanking sequence. 

15 

c) Homology of the flanking sequence in the databases 

Contrary to the "AIMS product" sequence and the sequence of 
the L223a clone, the sequence of the L223c clone showed homology in 

20 the databases with a maize EST. Two sequences of the same cDNA are 
found in Genebank: a sequence from the 5' end in accession AI001298, 
and a sequence from the 3' end in accession AI374506. This EST bears 
the name Isum2, without any explanation of this acronym. The homology 
was strong but restricted to a small portion of L223c. 

25 A PCR reaction was carried out on genomic DNA with the 

primers Isum2b (SEQ ID No. 18) and Isum2e (SEQ ID No. 20), which are 
located on either side of the presumed intron. The amplification product 
was cloned and sequenced. The sequence of this clone L157a (Fig.4) 
showed that the sequence present in the AIMS product, in L223a and in 

30 a portion of L223c corresponded to an intron of Isum2. 

B.2. Isolation of a second allele 

a) Screening of a collection of insertion mutants by reverse 
genetics ("gene machine") 

35 
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A population of 25 000 plants highly mutagenized with the 
mutator transposon was screened by PCR for an insertion in Isum2. The 
plants were planted in 10 blocks of 50 x 50 plants. Pools of DNAs from 
the 500 rows and 500 columns were analyzed by PCR for the presence 
5 of an amplification with the primer OMuA (SEQ ID No. 24 in mutator) in 
combination with a primer in an exon of Isum2. The four primers Isum2b 
(SEQ ID No. 18), Isum2c (SEQ ID No. 19), Isum2k (SEQ ID No. 21) and 
851 6g (SEQ ID No. 23) specific for Isum2 were tested. 

Positive results were obtained with the primers Isum2b and 

10 Isum2k. The plants in question were identified and the amplification was 
confirmed on individual plant DNA. Their descendants were then 
analyzed in order to distinguish somatic insertions (absent in the 
gametes) from germinative insertions (present in the gametes). Of all the 
insertions, only one was found in the subsequent generation. 

15 Sequencing of the PCR amplification products showed that it was an 
insertion in intron 1 of lsum2A, 3 bp upstream of exon 2, found in a plant 
denoted G2422. 

The grains of this plant G2422 were analyzed for the presence 
of the "germ-free grains" phenotype. The presence of many "parasitic" 

20 mutations in this highly mutagenized material made it difficult to evaluate 
some of the grains. Grains with a "germ-free grain" phenotype were 
detected. 

b) Lack of complementation between the two mutated alleles of 
25 isum2A. 

To prove that the "germ-free grain" phenotype of the emb*-8516 
mutant described by Heckel et al. (1999) was also caused by the 
insertion of mutator into the lsum2A gene, a complementation between 
the emb*-8516 mutant and descendants of the plant G2422 with an 

30 insertion of mutator into the same lsum2A gene was undertaken. 

Sixteen heterozygous +/emb*-8516 plants were pollinated by 6 
heterozygous +lemb*-2422 plants. In no case was the emb*-8516 
mutation complemented. This shows that the G2422 plants carry a 
mutation in the same gene which causes the emb phenotype of the 

35 emb*-8516 mutant. Since both have an insertion of mutator into the 
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lsum2A gene, it is established that this mutation of the isum2A gene 
causes the "germ-free grain" phenotype. 
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B.3. The lsum2A gene 

a) Genomic sequences oflsum2A 

For the lsum2A gene, a first genomic sequence was obtained 
5 by screening a genomic library of plants of the HD5 x HD7 genotype, and 
a second genomic sequence was obtained by PCR on plants of the 
A188 genotype. The clones used for the sequencing are shown in Fig. 4. 

The sequence of the HD5 x HD7 genotype (Fig. 1) comes from 
the two subclones L211c1 (9 kb Xhol fragment) and L211c2 (6 kb Xhol 
10 fragment). 

The sequence of the A188 genotype (Fig. 2) originates from the 
clones L157a (PCR with the primers Isum2b (SEQ ID No. 18) and 
Isum2e (SEQ ID No. 20)), L223a (genomic walk, see above) and L223c 
(genomic walk, see above). 

15 

b) cDNA sequences 

The partial cDNA sequence of the A188 genotype (SEQ ID 
No. 4, Fig. 3) originates from the L158a and L254 clones (Fig.4). The first 
was obtained by RT-PCR on 12 JAP embryos with the primers Isum2b 

20 (SEQ ID No. 18) and Isum 2c (SEQ ID No. 19), and the second by RT- 
PCR on 7 JAP albumens with the primers Isum2a (SEQ ID No. 17) and 
Isum2l (SEQ ID No. 22). 

The cDNA sequence of the genotype HD5 x HD7 (SEQ ID 
No. 3, Fig.2) was obtained from the genomic sequence (SEQ ID No. 1) 

25 using the Isum2 EST (above) to determine the 5' limit of the first exon 
and the 3' limit of the last exon. 

Table 4 below gives details of the various primers used in this 

example. 

30 
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Table 4: 
Primers used 



Name 


5' to 3' sequence 


SEQ ID 
No. 


Use 


Mu16 


TCYATAATGGCAATTATCTC 


8 


AIMS 
product 


adapMsel 


GATGAGTCCTGAGTAAN 


9 


AIMS 
product 


GW3 


GAAACTGGAAAGGCGAAATGGAGGGACG 


10 


L223c 


GW3b 


GCTCGATAGGTTTATTGTGATAACGTTGCT 
GG 


11 


L223c 


GW4 


G I CCC I CCA I I I CGCC I I I CCAG I I ICC 


12 


L223a, 


GW4b 


CCAGCAACGTTATCACAATAAACCTATCGA 
GC 


13 


L223a 


Mu15 


GAGAAGCCAACGCCAWCGCCTCYATTTC 
GTC 


14 


AIMS 
product 


AP1 


GGATCCTAATACGACTCACTATAGGGC 


15 


all "genomic 
walk" 


AP2 


CTATAGGGCTCGAGCGGC 


16 


all "genomic 
walk" 


Isum2a 


CTACCCGCAGCCAGCCTCGCATTCC 


17 


L254 


Isum2b 


GGCGGGGAAGAAGGGCTACAAGATGAAG 
AC 


18 


GM, L158a1, 
L157a 


Isum2c 


GCCAAGAAGAACACCAAGCGCAAGAAGAG 


19 


GM a 


Isum2e 


CGATCTGCTGGCCATATCCTAAGAG 


20 


L158a1, 
L157a 


Isum2k 


CATCTTCGAGAGCCTCTTCTTGCG 


21 


GM 


Isum2l 


CTTCATCTTGTAGCCCTTCTTCCCCG 


22 


L254 


851 6g 


GCACCCGTAACATTGTCGTAGTC 


23 


GM 


OMuA 


CTTCGTCCATAATGGCAATTATCTC 


24 


GM 



5 

-GM means "gene machine". 
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EXAMPLE 2: 

Construction of a vector comprising a polynucleotide encoding an 
ISUM2A polypeptide placed under the control of an inducible 
regulatory polynucleotide of the repressor type 

5 

The expression system described in this example constitutes an 
illustration of the expression system II summarized in Table 3, for which 
details of the general principle of functioning are given in the description. 

This expression system involves the construction of two vectors, 
10 respectively: 

(i) the vector pRDP5, whcih contains the isum2A gene placed 
under the control of the TRE sequence recognized by the tTA activator; 
and 

(ii) the vector pRDP4, which contains the gene of the tTA 
15 activator, which is active in the absence of tetracycline, placed under the 

control of the rice actinl promoter, which is a strong and constitutive 
promoter in monocotyledons. 

1. Construction of the vector pRDPS 

20 

The vector pTRE, sold by the company CLONTECH 
Laboratories Inc., and which is also described in the GENBANK 
database under the accession number U89931, is used as starting 
vector (Fig. 6). The vector pTRE is cleaved with the BamHI restriction 
25 endonuclease, at nucleotides 471 and 483 of the vector, the DNA 
fragment located between the abovementioned BamHI sites then being 
excised. 

After cleavage and excision, the sticky ends are filled in using 
Klenow polymerase. 
30 The isum2A gene described in Example 1 , more specifically the 

genomic region ranging from the ATG codon to the end of the reading 
frame and also comprising a terminator sequence, is then amplified by 
PCR. 
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The isum2A gene amplification product is then ligated into the 
open pTRE vector in order to construct the vector pRDP5 illustrated in 
Figure 5. 

In the vector pRDP5, the isum2A gene is placed under the 
5 control of the prTRE promoter, which is recognized by the tTA activator. 

2) Construction of the vector pRDP4 

The vector pRDP4 contains the gene of the tTA activator under 
10 the control of the rice actinl promoter. 

The starting vector is the plasmid pDM302, which was 
described by CAO et al. (1992) and which is illustrated in Figure 7. 

The plasmid pDM302 is digested with the Smal restriction 
endonuclease. 

15 The vector pTet-Off sold by the company CLONTECH 

Laboratories Inc. (catalog reference No. K 1620-A, from the year 2000), 
which is illustrated in Figure 8, is then used to amplify the sequence of 
the tTA gene by PCR. 

The tTA gene amplification product is then ligated into the open 
20 Smal site of the plasmid pDM302 in order to construct the plasmid 
pRDP4 illustrated in Figure 9. 

The plasmid pRDP4 comprises the tTA gene under the control 
of the rice actinl promoter. 

A diagram for the functioning of the inducible expression 
25 system described in the present example is represented in Figure 13, 
even in the absence of tetracycline (Figure 13A), or in the presence of 
tetracycline (Fig.13B). 

A summary of the characteristics of the various vectors used or 
prepared according to Example 2 is given in Tables 5 to 8 below. 

30 
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TABLE 5 

Positions of the characteristic elements of the 
plasmid pDM302 (unit: kilobase) 



vector pSP72 


0.00 


0.04 


actinl promoter 


0.04 


1.43 


polylinker 


1.43 


1.46 


Smal 


1.46 


1.47 


Basta resistance 


1.47 


2.03 


Smal 


2.03 


2.04 


polylinker 


2.03 


2.08 


nos terminator 


2.08 


2.34 


vector pSP72 


2.34 


4.74 



TABLE6 

Positions of the characteristic elements of the 
plasmid pRDP4 (unit: kilobase) 



vector pSP72 


0.0 


0.04 


actinl promoter 


0.04 


1.43 


polylinker 


1.43 


1.46 


tTA 


1.47 


2.47 


polylinker 


2.47 


2.52 


nos terminator 


2.52 


2.78 


vector pSP72 


2.78 


5.18 
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TABLE 7 

Positions of the characteristic elements of the 
plasmid pTRE2 (unit: kilobase) 



promoter with TRE 


0.00 


0.44 


polylinker 


0.44 


0.49 


SV40 terminator 


0.49 


0.95 


Vector pUC 


0.95 


3.10 



TABLE 8 

Positions of the characteristic elements of the 



10 



promoter with TRE 


0.00 


0.44 


polylinker 


0.44 


0.49 


lsum2A (ATG to terminator) 


0.49 


2.73 


SV40 terminator 


2.73 


3.19 


vector pUC 


3.19 


5.34 



EXAMPLE 3. 

Construction of a vector comprising a polynucleotide encoding an 
is ISUM2A polypeptide placed under the control of a regulatory 
polynucleotide of the inducible activator type 

The expression system according to Example 3 constitutes an 
illustration of the expression system I summarized in Table 3, which is 
20 commented upon in the description. 

The expression system according to Example 3 comprises two 
expression cassettes, respectively: 

(i) an expression cassette comprising the isum2A gene placed 
under the control of a promoter containing the UAS sequence recognized 
25 by the GVG activator; and 
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(ii) an expression cassette encoding the GVG activator, placed 
under the control of the rice actinl promoter, which is a strong and 
constitutive promoter in monocotyledons. 

5 1. Construction of the vector pRDP2 

The starting vector is the plasmid pTA7001 described by 
AOYAMA et al. (1997) and which is represented in Figure 10. 

The vector pTA7001 is subjected to digestion with the Sse8387l 
10 and Pmel enzymes in order to excise the cauliflower mosaic virus 35S 
promoter. 

After digestion, the sticky ends are filled in using Klenow 
polymerase. 

The rice actinl promoter is then amplified by PCR, from the 
15 plasmid pDM302 illustrated in Figure 7. 

The rice actinl promoter amplification product is then ligated 
into the predigested vector pTA7001, so as to construct the plasmid 
pRDP2 illustrated in Figure 11. 

20 2. Construction of the vector pRDP3 

The starting vector is the vector pRDP2 constructed according 
to the protocol above. 

The vector pRDP2 is first subjected to digestion with the Xhol 
25 and Spel restriction endonucleases. 

After digestion, the sticky ends are filled in with Klenow 
polymerase. 

The coding region of the isum2A gene is then amplified by 

PCR. 

30 Finally, the isum2A gene amplification product is ligated into the 

open vector pRDP2 in order to construct the vector pRDP3 illustrated in 
Figure 12. 

The vector pRDP3 contains two expression cassettes, 
respectively: 
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(i) an expression cassette containing the isum2A gene placed 
under the control of the UAS sequence recognized by the GVG activator; 
and 

(ii) an expression cassette comprising the sequence encoding 
5 the GVG activator placed under the control of the constitutive rice actinl 

promoter. 

The GVG gene encodes a protein from fusion between the 
DNA-binding domain of the GAL4 protein, the VP16 gene activator 
domain and the rat glucocorticoid receptor (GR). 

10 A scheme of the functioning of the inducible expression system 

described in the present example is represented in Figure 14, even in the 
presence of glucocorticoid (Figure 14A), or in the absence of 
glucocorticoid (Figure 14B). 

When the expression system according to Example 3 is used in 

15 the presence of the activator inducer signal consisting of a glucocorticoid 
hormone, the GVG hybrid transcription factor is transported into the 
nucleus and strongly activates all the promoters containing the UAS 
sequence (Figure 14B). 

Tables 9 to 11 below give a summary of the main 

20 characteristics of the various vectors used or prepared according to 
Example 3. 
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TABLE 9 

Positions of the characteristic elements of the 
plasmid pTA7001 

5 



right border 


0.00 


0.03 


Sse83871 


0.05 


0.05 


35S promoter 


0.05 


0.86 


Pmel 


0.87 


0.87 


GVG 


0.87 


2.18 


E9 terminator 


2.21 


2.76 


Nos promoter 


2.78 


3.11 


hygromycin resistance 


3.12 


4.15 


Nos terminator 


4.15 


4.40 


3A terminator 


4.89 


4.42 


Spel 


4.89 


4.89 


Xhol 


4.94 


4.94 


minimum TATA 


5.00 


4.94 


6 x GAL4 UAS 


5.20 


5.00 


left border 


5.84 


5.86 


pBI101 


5.20 


0.04 
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TABLE 10 

Positions of the characteristic elements of the 
plasmid pRDP2 



right border 


0.00 


0.03 


actinl promoter 


0.05 


1.28 


GVG 


1.28 


2.60 


E9 terminator 


2.62 


3.18 


Nos promoter 


3.20 


3.53 


hygromycin resistance 


3.54 


4.56 


Nos terminator 


4.56 


4.81 


3A terminator 


5.31 


4.84 


Spel 


5.31 


5.31 


Xhol 


5.36 


5.36 


minumum TATA 


5.41 


5.36 


6 x GAL4 UAS 


5.61 


5.41 


left border 


6.25 


6.28 


PBI101 


5.61 


0.04 
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Table 11 

Positions of the characteristic elements of the plasmid pRDP3 

5 



right border 


0.00 


0.03 


actinl promoter 


0.05 


1.28 


GVG 


1.28 


2.60 


E9 terminator 


2.62 


3.18 


Nos promoter 


3.20 


3.53 


hygromycin resistance 


3.54 


4.56 


Nos terminator 


4.56 


4.81 


3A terminator 


5.31 


4.84 


lsum2A 


7.18 


5.31 


minimum TATA 


7.23 


7.18 


6 x GAL4 UAS 


7.44 


7.23 


left border 


8.08 


8.10 


pBI101 


7.44 


0.04 



EXAMPLE 4: 

Obtaining plants transformed with a nucleic acid comprising a 
10 polynucleotide which allows the production of an ISUM2A 
polypeptide 

The transformation of a plant with the aim of obtaining a stable 
expression of the polynucleotide of interest in the transformed plant is 
necessary in order to ensure long-lasting modification of the maize grain, 
is Experiments were carried out with a construct of vectors described in 
Examples 2 and 3. 

4-1 PARTICLE GUN 



20 The method used is based on the use of a particle gun identical 

to that described by FINER (1992). 
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The target cells are rapidly dividing undifferentiated cells which 
have conserved an ability to regenerate whole plants. This type of cell 
makes up the maize embryogenic callus (referred to as type II callus). 
These calluses are obtained from immature embryos of the Hill genotype 
5 according to the method and on the media described by ARMSTRONG 
(1994) MAIZE HANDBOOK; 1994, M. FREELING, V. WALBOT EDS.; 
PP 665-671 . Fragments of such calluses, having a surface area of from 
10 to 20 mm 2 , were placed, 4 h before bombardment, in a proportion of 
16 fragments per dish, at the center of a Petri dish containing a culture 

10 medium identical to the initiating medium, supplemented with 0.2 M of 
mannitol + 0.2 M of sorbitol. The plasmids described in the examples 
above and carrying the genes to be introduced are purified on a Qiagen R 
column according to the manufacturer's instructions. They are then 
precipitated onto particles of tungsten (M10) according to the protocol 

15 described by KLEIN (1987). The particles thus coated are projected onto 
the target cells using the gun and according to the protocol described by 
J. FINER (1992). The dishes of calluses thus bombarded are then 
sealed with Scellofrais R and then grown in the dark at 27°C. The first 
subculturing takes place 24 h later, and then every fifteen days for 

20 3 months on medium identical to the initiating medium supplemented 
with a selective agent. After 3 months, or sometimes earlier, calluses are 
obtained, the growth of which is not inhibited by the selective agent, and 
which are usually and mainly made up of cells resulting from the division 
of a cell which has integrated into its genetic inheritance one or more 

25 copies of the selection gene. The frequency of production of such 
calluses is approximately 0.8 callus per bombarded dish. 

These calluses are identified, individually separated, multiplied, 
and then cultivated so as to regenerate plantlets, by modifying the 
hormone and osmotic balance of the cells according to the method 

30 described by VAIN et al. (1989). These plants are then acclimatized in a 
greenhouse, where they can be crossed so as to obtain hybrids or 
self pollinated. 
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4.2 Transformation with Agrobacterium 

Another transformation technique which can be used in the 
context of the invention employs Agrobacterium tumefaciens, according 
5 to the protocol described ISHIDA et al. (1996), in particular using 
immature embryos of 10 days post-pollination. All the media used are 
referenced in the cited reference. The transformation begins with a 
coculturing phase in which the immature embryos of the maize plants are 
brought into contact for at least 5 min with Agrobacterium tumefaciens 

10 LBA 4404 containing the superbinary vectors. The superbinary plasmid 
is the result of homologous recombination between an intermediate 
vector carrying the T-DNA containing the gene of interest and/or the 
selection marker derived from the plasmids described in the above 
examples, and the vector pSB1 from Japan Tobacco (EP 672 752) which 

15 contains: the virB and virG genes of the plasmid pTiBo542 present in the 
supervirulent A281 strain of Agrobacterium tumefaciens (ATCC 37349) 
and a homologous region found in the intermediate vector which allows 
this homologous recombination. The embryos are then placed on LSA 
medium for 3 days in the dark at 25°C. A first selection is carried out on 

20 the transformed calluses: the embryogenic calluses are transferred onto 
LSD5 medium containing phosphinothricin at 5 mg/l and cefotaxim at 
250 mg/l (elimination or limitation of the contamination with 
Agrobacterium tumefaciens). This step is carried out for 2 weeks in the 
dark and at 25°C. The second selection step is carried out by 

25 transferring the embryos which have developed on LSD5 medium onto 
LSD10 medium (phosphinothricin at 10 mg/l) in the presence of 
cefotaxim, for 3 weeks under the same conditions as previously. The 
third selection step consists in excising the type I calluses which remain 
white (fragments 1 to 2 mm) and in transferring them, for 3 weeks in the 

30 dark and at 25°C, onto LSD 10 medium in the presence of cefotaxim. 

The regeneration of the plantlets is carried out by excising the 
type I calluses which have proliferated and transferring them onto LSZ 
medium in the presence of phosphinothricin at 5 mg/l and cefotaxim, for 
2 weeks at 22°C and under continuous light. 
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The plants which have regenerated are transferred onto 
RM + G2 medium containing 100 mg/l of Augmentin for 2 weeks at 22°C 
and under continuous light, for the development step. The plants 
obtained are then transferred into a phytotron for the purpose of 
5 acclimatizing them. 

EXAMPLE 5 : Biochemical analyses 

Biochemical analyses were carried out on maize grains obtained after 
10 selfpollination of heterozygous plants carrying the emb8516 mutation. 
Thus, the descendants are made up of grains of wild-type phenotype 
with an embryo (either wild-type homozygotes, heterozygotes), or of 
mutant phenotype without an embryo (mutant homozygotes). 

These grains were sorted manually (visual sorting according to the 
15 phenotype) so as to separate those with a mutant phenotype from those 
with a wild-type phenotype. 

The methods used on these grains to measure the various 
parameters (solids content, assaying of crude ash, EWERS assaying of 
20 starch, assaying of water-soluble fats, assaying of amino acids, etc.) may 
be based on the following approved 'NFV or experimental 'XPV AFNOR 
standards (available on the Internet site http://www.afnor.fr , on-line 
standards): 

- Residual solids content (MSR 4H-130): NFV 03-708 March 1976. 
25 Maize: determination of the water content on whole grains and on 

ground grains. The method implemented in the present case does 
not use a refrigerated mill; for this reason, it is not true solids 
content, but simply a residual solids content on the grinding 
intended to be analyzed. 
30 - Assaying of crude ash (ash (2)): NFV 18-101 Oct 1977. Animal 
Feedstuffs Commission: assaying of crude ash. 

- EWERS assaying of starch (Ewers starch (2)): 3rd directive of the 
European Community with corrigendum ECOJ of 11/27/80. 
Assaying of starch, polarimetric method. 
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- Assaying of fats with hydrolysis (MGH (2)): NFV 18-117 August 
1997. Animal feedstuffs. Assaying of fats. Method B. 

- Assaying of walls (walls (2)): XP V 18-111 January 1998. Animal 
feedstuffs. Determination of the content of water-insoluble plant 

5 walls. 

- Assaying of amino acids (amino acids). The methods used are as 
follows: 

XP V 18 113 January 1998. Animal feedstuffs. Assaying of amino 

acids. 

10 XP V 18 114 January 1998. Animal feedstuffs. Assaying of 

tryptophan. 

- Assaying of total nitrogenous materials: DUMAS method: NFV 18- 
120. 

15 The results of the analyses carried out on the homozygous material 
comprising the Emb8516 mutation (Isum2 gene) and the corresponding 
wild-type control are represented in the table below: 



TABLE 12 



The data in the table are expressed in g/kg SC 





Wild-type 
(control) 


Emb8516 


Differential (%) 
/ control 


Solids content (SC) in 
g/Kg ot meai 


888.60 


885.90 


-0.30 


vvaii conienx 


1 \ O.I u 


1 4o.UU 


OA OA 

Z4.Z4 i 


twers siarcn 


0/0.4U 


# 1 y.ou 


ft Hft 
O.UO 


nyaroiyzaDie Tats 


Do.DU 


On 7H 
ZU. / u 


ft7 
-0/\40 


Asn ^rninerai,) 


1 ft 


1 -i ftn 
n \ .ou 


-zo.oo 


Ajinomoto SC in 
g/kg of meal 


902.70 


892.00 


-1.19 


Ajinomoto total 
nitrogenous material 


1 15.87 


107.62 


-7.12 


Lysine 
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Methionine 
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Cysteine 
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I . /4 
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C A O 
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Glycine 


4.43 


3.92 


-11.51 


Histidine 


3.43 


3.14 
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Isoleucine 


4.10 


3.70 


-9.76 


Leucine 


14.29 
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-10.57 


Phenylalanine 


5.76 


5.04 


-12.50 


Serine 


5.65 


4.93 


-12.74 


Tyrosine 


3.77 


3.48 


-7.69 


Valine 


5.65 


5.27 


-6.73 
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Total amino acids 


103.32 


92.81 


-10.17 



These analyses show that there is no difference in terms of the 
solids content of the grains. 

On the other hand, the absence of embryo associated with the 
5 Emb8516 mutation results in a large decrease in the content of 
hydrolyzed fats (less than one third of the value of the control) and of 
mineral ash (decrease of 28.8%), and a slight decrease in total amino 
acids (10.2%). These data confirm the importance of the embryo as a 
source of fats and minerals and, indirectly, the effect of negative 
10 regulation of the expression of the ISUM2 polypeptide on the quality in 
terms of oil of the mature grain. 

These decreases in fats, ash (minerals) and amino acids are, 
moreover, compensated for by an increase in the wall content (24.2%) 
and the starch content (6.1%), which confirms the close relationship 
15 which exists between development of the embryo and development of 
the albumen and the possibility of modulating the agronomic qualities of 
the embryo and/or the albumen as a function of the desired applications 
(semolina production or starch production). 

These experiments therefore confirm the advantage of using 
20 the nucleic acid and polypeptide sequences according to the invention 
involved in the development of the embryo and/or the albumen, for 
modulating the agronomic qualities of the mature grain. 
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TABLE 13 



SEQUENCES SEQ ID No 


Descriotion 


I 
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illilld loUIII^ct 


18 


Primer Isum2b 


19 


Primer Isum2c 


20 


Primer Isum2e 


21 


Primer Isum2k 


22 


Primer Isum2l 


23 


Primer 851 6g 


24 


Primer OMuA 
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Claims 

1. A nucleic acid comprising a polynucleotide encoding an 
ISUM2A polypeptide, chosen from the sequences having at least 95% 
amino acid identity with the sequences SEQ ID No. 5 and SEQ ID No. 6, 

5 or encoding a fragment of at least 100 consecutive amino acids of an 
ISUM2A polypeptide, and also a nucleic acid of complementary 
sequence. 

2. The nucleic acid as claimed in claim 1, comprising a 
10 polynucleotide having at least 95% nucleotide identity with the nucleotide 

sequence SEQ ID No. 1, or with a fragment of at least 300 consecutive 
nucleotides of this nucleotide sequence, or a nucleic acid of 
complementary sequence. 

15 3. The nucleic acid as claimed in claim 1, comprising a 

polynucleotide having at least 95% nucleotide identity with the nucleotide 
sequence SEQ ID No. 2, or with a fragment of at least 300 consecutive 
nucleotides of this nucleotide sequence, or a nucleic acid of 
complementary sequence. 

20 

4. The nucleic acid as claimed in claim 1, comprising a 
polynucleotide having at least 99% nucleotide identity with the nucleotide 
sequence SEQ ID No. 3, or with a fragment of at least 300 consecutive 
nucleotides of this nucleotide sequence, or a nucleic acid of 

25 complementary sequence. 

5. The nucleic acid as claimed in claim 1, comprising a 
polynucleotide having at least 99% nucleotide identity with the nucleotide 
sequence SEQ ID No. 4, or with a fragment of at least 300 consecutive 

30 nucleotides of this nucleotide sequence, or a nucleic acid of 
complementary sequence. 

6. The nucleic acid as claimed in one of claims 1 to 5, 
characterized in that it comprises a regulatory polynucleotide capable of 
35 regulating the transcription or the translation of the polynucleotide 
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encoding the ISUM2A polypeptide, or the fragment of this polypeptide, or 
a nucleic acid of complementary sequence. 

7. The nucleic acid as claimed in claim 6, characterized in that 
5 the regulatory polynucleotide allows overexpression of the ISUM2A 

polypeptide or of the fragment of this polypeptide in a host organism. 

8. The nucleic acid as claimed in either of claims 6 and 7, 
characterized in that the regulatory polynucleotide is sensitive to the 

10 action of an inducer signal. 

9. The nucleic acid as claimed in one of claims 6 to 8, 
characterized in that the regulatory polynucleotide is an inducible 
transcription- or translation-represssing polynucleotide. 

15 

10. The nucleic acid as claimed in one of claims 6 to 8, 
characterized in that the regulatory polynucleotide is an inducible 
transcription- or translation-activating polynucleotide. 

20 11. The nucleic acid as claimed in claim 10, characterized in 

that the inducible activator polynucleotide is a polynucleotide encoding 
the GVG activator, placed under the control of the promoter of the rice 
actin 1 gene. 

25 12. A nucleic acid which can be used as a probe or primer 

which hybridizes specifically with a nucleic acid encoding an ISUM2A 
polypeptide, characterized in that it comprises at least 12 consecutive 
nucleotides of a nucleic acid as claimed in one of claims 1 to 5, or a 
nucleic acid of complementary sequence. 

30 

13. A nucleotide probe or primer which hybridizes specifically 
with a nucleic acid encoding an ISUM2A polypeptide, characterized in 
that it is chosen from the sequences SEQ ID Nos. 10 to 13 and 17 to 23. 
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14. The use of a nucleic acid as claimed in either of claims 12 
and 13, for detecting the presence of a polynucleotide encoding an 
ISUM2A polypeptide in a sample. 

5 15. A recombinant vector comprising a nucleic acid as claimed 

in one of claims 1 to 13. 

16. A host cell transformed with a nucleic acid as claimed in one 
of claims 1 to 13 or with a recombinant vector as claimed in claim 15. 

10 

17. The host cell as claimed in claim 16, characterized in that it is 
a plant cell. 

18. The use of a nucleic acid as claimed in one of claims 1 to 13, 
is of a recombinant vector as claimed in claim 15 or of a transformed host 

cell as claimed in either of claims 16 and 17, for producing a transformed 
plant capable of producing grains with altered germ development. 

19. The use of a nucleic acid as claimed in either of claims 6 and 
20 7, of a recombinant vector comprising such a nucleic acid or of a 

recombinant host cell transfected or transformed with this nucleic acid, 
for obtaining a transformed plant capable of producing grains rich in oil. 

20. A plant transformed with a nucleic acid as claimed in one of 
25 claims 1 to 13 or with a recombinant vector as claimed in claim 15. 

21. A transformed plant comprising a plurality of host cells as 
claimed in claim 17. 

30 22. The transformed plant as claimed in either of claims 20 and 

21 , characterized in that it derives from a plant in which the two copies of 
the lsum2A gene each carry at least one mutation which causes a 
deficiency in the production of an ISUM2A polypeptide of sequence SEQ 
ID No. 5 or 6. 

35 
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23. A method for obtaining a transformed plant capable of 
producing grains with altered germ development, characterized in that it 
comprises the following steps: 

a) transforming at least one plant cell with a nucleic acid as 
5 claimed in one of claims 6 to 10; or with a recombinant vector comprising 

a nucleic acid as claimed in one of claims 6 to 10; 

b) selecting the transformed cells obtained in step a) which have 
integrated into their genome at least one copy of a nucleic acid as 
claimed in one of claims 6 to 10; 

10 

c) regenerating a transformed plant from the transformed cells 
obtained in step b). 

24. The method as claimed in claim 23, in which at least one plant 
15 cell is transformed in step a) with Agrobacterium tumefaciens containing 

a nucleic acid as claimed in one of claims 6 to 10 or a recombinant 
vector comprising a nucleic acid as claimed in one of claims 6 to 10. 

25. A transformed plant, as obtained by the method as claimed in 
20 either of claims 23 and 24. 

26. A hybrid transgenic plant obtained by crossing a plant as 
claimed in claim 25. 

25 27. A part of a transformed plant as claimed in either of claims 25 

and 26. 

28. A method for obtaining plant grains with altered germ 
development, characterized in that it comprises the following steps: 
30 a) cultivating, until pollination, a plant in which the two copies of 

the lsum2A gene each carry at least one mutation causing a deficiency 
in the production of the ISUM2A polypeptide, and into the genome of 
which has been introduced a nucleic acid as claimed in claim 9; 

in the absence of the repressor inducer signal to which the 
35 repressor regulatory polynucleotide is sensitive; 
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b) bringing the transformed plant defined in a) into contact with the 
repressor inducer signal to which the repressor regulatory polynucleotide 
is sensitive, for a period of time ranging from the beginning of pollination 
to the end of grain formation; 

5 c) recovering the mature grains altered in germ development. 

29. A method for obtaining plant grains with altered germ 
development, characterized in that it comprises the following steps: 

a) cultivating, until pollination, a plant in which the two copies of 
10 the lsum2A gene each carry at least one mutation causing a deficiency 
in the production of the ISUM2A polypeptide, and into the genome of 
which has been introduced a nucleic acid as claimed in claim 10; 

in the presence of the activator inducer signal to which the 
activator regulatory polynucleotide is sensitive; 
15 b) continuing the cultivation of the transformed plant defined in a) 

in the absence of the activator inducer signal to which the activator 
regulatory polynucleotide is sensitive, from the period following 
pollination; 

c) recovering the mature grains altered in germ development. 

20 

30. A seed altered in germ development, as obtained by the 
method as claimed in either of claims 28 and 29. 

31 . A seed altered in germ develpment, characterized in that each 
25 of its constitutive cells comprises, in a form artificially integrated into their 

genome, a nucleic acid as claimed in one of claims 6 to 10. 

32. A product of transformation of a seed as claimed in either of 
claims 30 and 31 . 

30 

33. The transformation product as claimed in claim 32, 
characterized in that it is a starch. 

34. The transformation product as claimed in claim 32, 
35 characterized in that it is a seed meal or an oil. 
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35. An ISUM2A polypeptide, or a fragment of this polypeptide, 
encoded by a nucleic acid as claimed in one of claims 1 to 10. 

5 36. The polypeptide as claimed in claim 35, characterized in that it 

has at least 95% amino acid identity with one of the sequences SEQ ID 
No. 5 and SEQ ID No. 6. 

37. An antibody directed against a polypeptide as claimed in either 
10 of claims 35 and 36. 

38. A method for detecting the presence of a polypeptide as 
claimed in either of claims 35 and 36, in a sample, comprising the 
following steps: 

15 a) bringing the test sample into contact with an antibody as 

claimed in claim 37; 

b) detecting the antigen/antibody complex possibly formed. 

39. A pack or kit for detecting a polypeptide as claimed in either of 
20 claims 35 and 36, in a sample, comprising: 

a) an antibody as claimed in claim 37; 

b) where appropriate, one or more reagents required for the 
detection of the antigen/antibody complex. 

25 40. The use of a nucleic acid or of an allelic variant of a nucleic 

acid as claimed in one of claims 1 to 5, in selection programs for 
obtaining plants with modified embyro size and/or development 
influencing the content of starch and/or of oil. 

30 41. A method for selecting plants with modified embryo size 

and/or development, comprising the steps of: 

a) genotyping plants (individuals) using nucleotide probes or 
primers obtained from the nucleic acids as claimed in one of claims 1 to 
5 or variants of these nucleic acids; 
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b) selecting, from these plants (individuals), those which 
comprise a high frequency of favorable alleles associated with the size 
and/or development of the embryo. 



